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A SECOND PIEBALD FAMILY FROM SUFFOLK. 

Bt E. a COCKAYNE, D.M.. F.R.C.P. 

In 1914 I published an account of a piebald family*, which originally came 
from the neighborhood of Bury St Edmunds in Suffolk. About a year ago at the 
Hospital for Sick Children, Great Ormond Street, I saw two piebald children, 
members of a family whose ancestors lived near Ipswich in Suffolk. It has been 
impossible to trace any relatioi^hip between these two families, nor even to find 
a surname common to both. In both of them there has been descent through 
females and neither knows the name of the member who first came to London. It 
is, however, most improbable that two unrelated families with so rare a condition 
lived in the same county. The only other piebald family recorded in England is 
the London one described by Bishop Harman in 1909 and this may be descended 
from the same stock. 

I am only able to give a short and incomplete pedigree, because the members 
of the family, whom I saw, have lost touch with their cousins and it. has been 
impossible to communicate with them. 

I, 2. Had a white frontal blaze. 

II, 1. Has dark hair and is normal, and has eight or nine normal children. 

II, 5. Has a white frontal blaze and is childless. 

II, 9. Has a white frontal blaze, but all her four children are normal, and one 
of her daughters has a number of children and grandchildren, who are likewise 
normal. 

II, 11. Piebald. 

III, 1 0. Aged 43. Has very fair hair with distinct white frontal blaze of medium 
size. The inner third of the left eyebrow is white. Almost the whole of the skin 
on the front of the chest and the greater part of that on the abdomen is unpig- 
mented and was severely reddened and blistered by the sun years ago, when he 
was in Malta and Corfu. Large areas of skin on the backs of both arms above the 
elbows and on the legs round the knees and upper part of the calves is also 
unpigmented, and there are smaller patches on the forearms and backs of the legs. 

III, 12. Normal with rather dark brown hair. 

IV, 2. Aged 18 years. Is dark haired and normal. 

IV, 3. Aged 14 years. Has blue eyes and very fair hair. She has a large 
white frontal blaze and a triangular area of unpigmented skin on the forehead, 
which ends above the root of the nose. The whole of the skin on the posterior 
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sarfkoe of tbe lower two-thirds of both upper arms is unpigmented and ends 
sharply in a curved line with its convexity upwarda There are smaller and less 
clearly defined patches on the forearma The skin over the knees and for a short 
distance above and a greater distance below them is also unpigmented, and forms 
a large oval well-defined area on the front of each leg. There are other similar 
patches on the upper part of the calves. The unpigmented skin becomes bright 
red when exposed to the sun, whereas the pigmented skin turns brown, but in 
winter little difference can be seen l^tween them. A large part of the skin of the 
chest and abdomen appears to be unpigmented, but has never been exposed to 
sunlight. 

IV, 4. Aged 9 years. Is very fair with blue eyes. She has a much smaller 
frontal blaze than her sister, but the V-shaped area of unpigmented skin on her 
forehead was very clearly defined, when I saw her, as it was still bright red from 
exposure to the sun. The unpigmented skin ends a short distance above the root 
of the nose and there are no hairs in her eyebrows or eyelashes. The skin over the 
elbows and on the backs of the arms above them is unpigmented and separated 
from the normal skin by a curved line, like that in her sister* but situated nearer 
to the elbow. The skin over the knees and above and below them is unpigmented, 
and these areas are similar in shape and size to those in her sister. There are 
other patches of unpigmented skin on the forearms and on the legs, especially on 
the upper part of the calves. The skin on the chest and abdomen has never been 
exposed to the sun, and it is impossible to be sure that any of it is unpigmented, 
though a large part of it appears to be. 

III, 14. 1 have been unable to see this man, but a photograph 1 have seen 
shows a small white frontal blaze. His brother says that he has other unpigmented 
areas similar in distribution to his own. His wife, HI, 13, has rather dark brown 
hair and brown eyes. 

IV, 6. Aged 13 years. Is dark and normal. 

IV, 6. Aged 10 years. Has grey eyes and fair hair with a very large white 
frontal blaze. The skin of the forehead and front of the neck is unpigmented. 
The greater part of the skin of the right arm over the deltoid, that of the inner 
side of the upper arm and along the radial side of the forearm almost as far as the 
wrist is unpigmented, and there is an area of unpigmented skin over the elbow. 
On the left arm the skin over the deltoid is unpigmented, and there is another 
unpigmented area along the radial side of the forearm. 

IV, 7 and 8. Aged 5 and 3 years respectively. Grey eyes cmd light brown hair. 
Normal. 

IV, 9. Aged 2 years. Very fair hair and grey eyes. There is a small white 
frontal blaze and the skin of the upper part of the forehead is unpigmented. On 
other parts of the body the skin is so fair that it is impossible to distinguish the 
unpigmented from the normal skin. 

HI, 15. The twin brother of III, 14 died at birth, and it is not known whether 
he was piebald or not. 

1—2 



4 Second Suffolk Piehodd Family 

III, 16. Died in infancy. It is not known whether he was piebald or normal. 

Ill, 17. Piebald. Died unmarried. 

II, 13. Piebald. The condition of her children. III, 18 to 22, and of her grand- 
child, IV, 10, is not known. 

In connection with the family reported in 1914 I should like to state that 
another piebald boy, a younger brother of VI, 2, was born soon after the publication 
of the paper. Unlike his brother he had very dark brown hair and his frontal blaze 
was smaller. On p. 200 I said that it was probable that in some members of the 
family at least the areas of unpigmented skin were larger and more numerous 
than I had stated. Proof of this was obtained in the case of the boy VI, 2. I saw 
him after he had been bathing at the seaside and found that he had extensive 
areas of unpigmented skin on the chest, smaller ones on the abdomen and flanks, 
several spots of unpigmented skin on his penis, and large patches on the legs as 
well as on the arms, so that it is probable that other members of the family had 
white patches on the legs, though they said that this wtis not so. There is little 
doubt therefore that the distribution of unpigmented skin is very similar in these 
two families, as one would expect if they have a common origin, though none of 
the mcmbei's of the family described in this paper has heterochromia iridis. They 
are like the majority of hereditary and isolated cases described by others in having 
the dorsal region fully pigmented and in possessing a white frontal blaze. Jenks 
has described a family of American whites of Welsh-Scottish origin*, in which the 
piebald pattern, transmitted by direct descent, differed very much from the usual 
one. All the piebald members had a stripe of unpigmented skin running down the 
mid-dorsal region and most of them had no frontal blaze in spite of having very 
extensive areas of unpigmented skin on the trunk and limbs. 

In this family as in other published cases, with the exception of the two families 
recorded by Meirowsky and Spickernagel, and by Meirowsky respectively •f, the 
condition has only been transmitted by those affected. Assuming that the con- 
dition is due to a single dominant gene, and that the piebald parents were all 
heterozygous, the expectation is that the piebald and normal members will be 
equal in number. In the family described in this paper in five complete sibships 
with 26 members, 11 were piebald, 13 were normal, and in 2 the condition is not 
known, and in the five sibships of the family described in 1914 with 28 members, 
15 were piebald and 13 normal. The numbers agree closely with those expected, 
if the piebald condition is due to a dominant gene. If, however, the full sibships 
in this family and the ten others I collected in my book are taken together, there 
is a slight excess of piebalds, 144 piebalds and 131 normal members. The two 
exceptional families, in which normal individuals are said to have transmitted the 
condition, and the one described by Jenks, are omitted from these figures. Con- 
sidering the sex incidence in the same eleven families, there were 79 males and 
81 females with the piebald pattern, so that both sexes are about equally affected. 

* Jenks, American Anthropologist , Vol. xvi. 1914, p. 221. 

t B. A. Cockayne, Inherited Abnormalities of the Skin and its Appendages^ 1988, p. 54. (Bibliography.) 
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PEDIGREES OF HEREDITARY EPICANTHUS. 

Bv C. H. USHER. M.B. 


In 1828 Schon* described the coadition now known as epicanthns* Von 
Ammon in 1831 named it and in 1860 he classihed it. His classification is given 
in my* paper on a pedigree of epi can thus and ptosis in the Annals of Eugenics'^, The 
epicanthic fold is represented in all races during foetal life J. Waardenburg§ found 
epicanthus present from the beginning of the third to the beginning of the sixth 
month of pregnancy. In his embryos it was never present as a fold covering the 
canthus medialis. Eislerjl states that the human foetus of 28-30 cm. has an epi- 
canthus which coalesces only with the upper lid fold to form the typical mongolian 
fold. In the first six months of life* according to Martin’s statistics 51 for epicanthus 
in the Munich population, 331 of males and 32*6 7o females have epicanthus, 
but the numbers are re<luced to 3*3 % 2*6 7© respectively in the years 12 to 25, 

thus showing that in most cases an epicanthus disappears in early life. In con- 
sequence peculiar difficulties are encountered in the investigation of a pedigree of 
simple epicanthus and these are added to by the circumstance that individuals 
who have epicanthus and their relations are usually unaware that an abnormal 
condition is present. This is in marked contrast with the characteristic cases of 
ptosis with epicanthus which are well known to the relatives. Clinical observation 
sometimes shows the presence of an epicanthus at the first examination and its 
absence at a subsequent examination. Thus: (1) A female (Fig. 21, II, 1, p. 15) 
at age of 5 years had well-marked bilateral epicanthus, which had disappeared by 
the age of 24. (2) Another female (Fig. 12, II, 8, p. 11) child when seen at 5 years 
of age had a unilateral epicanthus, which was itot present 21 years later. (3) A male 
(Fig. 17, III, 3, p. 13) with bilateral epicanthus in infancy had at the age of 7 no 
epicanthus on right side, but the remains of one on left side. (4) A female when 
seen in infancy and again at 5 years of age had bilateral epicanthus, which was 
not present at the age of 9. 

Published pedigrees of simple epicanthus are few. I know of ten of which 
Meirowsky's is the only large one, though in a pedigree by Rosenstein, epicanthus 
occurs in four generations. The first of these ten pedigrees to appear was V. Ammon’s 
published in 1860. 

* Sohdn, Matth. Joh. Albrecht, Handhuch der pathologischen Anatomic/ de$ menachlichen Aug^, 
Hamburg, S. — 60, 1828. 

t Annals of Eugenics^ Vol. i. Parts i and ii. pp. 128 — 188, 1925. 

X Keith, Human Embryology and Morphology^ 5th ed. p. 244, 1988. 

§ Waardenburg, P. J., Qraefss Arch. /. Ophth. S. 124, 221—229, 1980. 

II Eisler, P., Kurzes Handbuch der Ophthalmologie (Sobieok u. Briickner) i. S. 280, 1980. 

IT Martin, Budolf, Lehrbueh der Anthropologic, Jena, S. 425, 1924. 



6 Hereditary Epicanthus 

V. Ammon, Fig. 1, observed a son and three daughters with a small tarsal 
epicanthus which they had inherited from their mother. 
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Fig. 1. 

Dhews, Fig. 2, knew a family in Munich in which a mother, her son and 
daughter had epicanthus. 
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Fig. 2. 


Galezowski, Fig. 3, in 1875 wrote that epicanthus is most frequently double, 
and it is congenital. He had seen it hereditary in a family ; the father and the 
two children presented the same deformity in an excessive degree. 
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Fig. 8. 


Manz, Fig. 4, saw five children in a family of ten with epicanthus.' The other 
five children did not have the deformity. Sex and order of birth not given. No 
mention of eye movements. 
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Fig. 4. 
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ScHBlDT^ Fig. 5, observed partial lid fold and epicanthus in four of six children 
of a &mily of Alpine descent, whose parents, grandparents and brother and sister 


were not affected. 
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Rosenstein, Ist family, Fig. 6, IV, 2, a girl, age 9, with bilateral epiccuithus 
and strabismus, H. 6 D. in right and 4 D. in left eye and 1 D. of astigmatism in 
each. Asymmetry of face. IV, 1, her brother, age 13, with bilateral epicanthus 
and concomitant convergent strabismus. H. R. 3 D., L. 4 D. with 1 D. of astigmatism 
in each. No facial asymmetry. The mother. III, 1, age 43, has bilateral epicanthus. 
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JS 

Fig. 6. 

H. 6 to 7 D., astigmatism 1*5 D. in each. Very marked facial asymmetry. II, 1, the 
maternal grandmother, always had bad vision, bridge of nose was flat with vertical 
skin fold on both sides. I, 1, the great grandmother, had always poor vision, also 
epicanthus and facial asymmetry. There is direct transmission of epicanthus in 
four generations. 

Meibowsky, Fig, 7 (p, 8), In generation I a female married twice. From the first 
marriage she had a son and daughter both with epicanthus. The son had nine 
children, all males, of whom six had the hereditary character. All affected members 
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who toarried transmitted epioanthus to some of their descendants, whilst the 
unaffected members had children without epicanthus. In the same manner the 
three children of the second marriage of whom two were affected, both males, 
transmitted the affection to two further generations. Of the 87 descendants of 
the female in generation I, 31 have epicanthus. 

Bosenstein, 2nd family, Fig. 8, III, 1, a male, age 29, with bilateral epicanthus 
and a high degree of myopia, B. 13 and L. 16, astigmatism of 1 D. in each and 
blepharochalasis of left. Ill, 2, his twin brother, with a less degree of myopia, had 
the same condition. Ill, 3, the youngest brother, a 14-year-old boy, who died of 
inflammation of the lungs, was short-sighted and had exceptionally well-marked 
bilateral epicanthus. Ill, 4, two unaffected brothers. II, 1, the father, died. He 
had a high degree of short sight and bilateral epicanthus. It is not known whether 
any of the grandparents (1, 1 — 4) had a similar anomaly. There is direct inheritance 
of epicanthus with other anomalies of the eye from father to three sons. 

r ,6^ iQ? . 0 ’ 
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Fig. 8. 

Bosenstein, 3rd family, Fig. 9, III, 1, a 1 5-year-old boy with bilateral epicanthus 
and myopia, R. 5 D., L. 7 D., with astigmatism of 1 D. in each. He is an only child. 
II, 1, his father, died. He had bilateral epicanthus and very poor vision both far 
and near. It is not known whether the boy’s grandparents (I, 1 and 2) or near 
relations had a similar anomaly. There is direct inheritance of the deformity from 
father to son. 
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Fig. 9. 
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Waabdenburo, Fig. 10, observed epicaothus in five brothers. Their father did 
not have the character, but their mother had it in a slight degree. 

6 
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Fig. 10. 

In these 10 pedigrees are 69 persons with epicanthus ; 35 are males, 23 are 
females, and in 11 the sex is not recorded. In 8 pedigrees (Figs. 1 — 3, 6 — 10) 
the inheritance is continuous with transmission through the male thirteen times 
and through the female nine times. In 2 pedigrees (Figs. 4 and 5) epicanthus 
is confined to a single sibship. Consanguinity is not recorded in any of the 
pedigrees. 

In submitting the results of an investigation of some new pedigrees of simple 
epicanthus — 24 in all — I am fully aware how unsatisfactory these are in some of 
the pedigrees, owing to the probable disappearance of the epicanthus in certain 
cases, but I venture to record them in the hope that when considered collectively 
some light may be thrown on such pedigrees*. 

My own material consists of epicanthus pedigrees which show heredity. In 
some of these, investigation originated from clinical notes made in some instances 
many years ago, and in others from cases which Professor Lowf had seen from one 
to three days after birth in the years 1 924 — 27. With few exceptions, only examined 
cases of epicanthus occurring in the pedigrees have been accepted. None of the 
pedigrees is large. Examples have already been given of the disappearance of 
epicanthus. In addition to these proved cases, in some pedigrees individuals 
without an epicanthus are suspected of formerly having had this character. The 
three youngest members of a sibship, in Fig. 11, have bilateral epicanthus, but 

.6 
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Fig. 11. 

two older ones when they were examined had no epicanthus. The first-born, not 
seen, had died in infancy. The distribution of the abnormality is suggestive that 

* Some of these pedigrees are briefly described on pp. 10 — 18, and fuller particulars of each 
individual pedigree are provided on pp. 18 — 26. 

t Professor Alexander Low, University of Aberdeen. 
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one or more of the three first bom sibs may have had epicanthus in infiumy. Their 
lather has congenital oolobona of iris and microoomea. 

(The + sign above individuals in the pedigree cheurts indicates that they have 
been examinecL) 

In Fig. 12 are two sisters with affected offspring. One of them, II, 8, had 
unilateral epicanthus at the age of 5 years, whereas there was no epicanthus 

C^clrl' . 



when she was 26 years of age. An older sister, II, 6, with four affected children, 
has no epicanthus. She had not been seen in childhood, and similarly with her 
mother (not shown in abbreviated chart), I, 2. It may also be of significance that 
only the youngest of the four affected children of II, 5 has bilateral epicanthus, in 
the others it is unilateral 


In Fig. 13, a mother with two children, all showing well-marked bilateral 
epicanthus, said that her mot|ier, I, 1, had the same condition. Her medical man, 



m ,9 

Fig. 18 . 

however, reported that she has no traces of epicanthus. Had it possibly dis- 
appeared ? 

In Fig. 14 are two sisters with affected offspring. One of them, II, 6, has a 
definite epicanthus on left side only. Her sister, II, 4, who is older than she is, 
has no epicanthus. Their parents, I, 1 and 2, have no epicanthus. The questions 
arise : first, had II, 6 bilateral epicanthus at an earlier age and, secondly, did 11, 4 
and I, 1 or I, 2 formerly have epicanthus? 
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Hereditary Epicanthiis 
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Fig. 14. 


In Fig. 15 a male, IV, 3, with bilateral epicanthus in infancy and at age of 5 
has now, .at the age of 9, what appears to be only the upper part of each epicanthus 
remaining. The boy s maternal aunt. III, 13, age 12, has a bilateral fold resembling 
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Fig. 16. 


the upper part of an epicanthus, which does not extend down so far as the internal 
tarsal ligament. She is marked in the chart as h.aving no epicanthus, though 
possibly, as in the case of IV, 3, a definite bilateral epicanthus was present earlier. 
These cases (IV, 3 and III, 13) and also the case of a male, II, 3 in Fig. 29, 
suggest that the epicanthus disappears from below upwards. In the mother’s 
sibship, epicanthus occurs only amongst the younger members (Fig. 15, III, 12 
and 14), so that the mother at their age may have had the same condition. 

The next four pedigrees of epicanthus in three generations illustrate direct 
inheritance of epicanthus. Fig. 16, a male, I, 2, is dead, but his photograph shows 
epicanthus. He married twice and transmitted the condition to a member of both 
his families. His daughter, II, 2, and her son. III, 3, have epicanthus. It is 
unknown whether II, 1, 3 and 4, three unaffected parents who have children with 
epicanthus, had the same condition in childhood. 
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Fig. 17. Albert S., Ill, 1, age 7, with remaiDS of an epicanthus on left side 
only, in infancy had bilateral epicanthus. His brother, III, 3, and sisters, III, 2 
and 4, have bilateral epicanthus, which is most conspicuous in the youngest sister, 
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Fig. 17. 

aged 2. The father, II, 1, has unilateral epicanthus on left side. The mother, II, 2, 
illegitimate, has distinct bilateral epicanthus. The paternal grandparents, I, 1 and 
2. have no epicanthus. Maternal grandfather, I, 3, is dead. Maternal grandmother, 
I, 6, has feebly marked bilateral epicanthus; I, 6, three females, are dead. I, 4 
and 7 and II, 4 have no epicanthus. All those affected in this pedigree have 
epicanthus palpebralis. 

Fig. 18 shows a pedigree with epicanthus in three generations and direct 
transmission of the affection. 
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Fig. 18. 
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Fig. 19. Epicanthus in three generations. Of 11 members who were examined 
in a sibship of 11, ten have epicanthus. In three of these, II, 1, 8 and 10, the 



Fig. 19. 


epicanthus is unilateral, it has an attenuated appearance and disappears at 
lower edge of the internal tarsal ligament. Their mother, I, 2, has bilateral epi- 
canthus. Ill, 4 has left unilateral epicanthus ; III, 5 and 6 bilateral epicanthus 
palpebralis, and III, 7 and 8 bilateral epicanthus tarsalis. 

An example of what appears to be discontinuous inheritance is seen in Fig. 20. 
I, 1, now deceased, is said to have had no epicanthus, but was blind from inflamed 
eyes in infancy. II, 4 when examined had no epicanthus, which may however have 
been present in childhood. Other examples are seen in Figs. 14 and 16 on pp. 12, 13. 
It is important to recollect that if certain parents in these pedigrees had been 
examined in childhood, epicanthus might have been found, in which case the 
inheritance would be continuous. 

I .Q 

n .9 4> .6 ‘9 'f 

m ,6 .6 


Fig. 20. 


The next three pedigrees show the abnormality limited to members of a single 
sibship. 

Fig. 21 shows two normal parents with six children, of whom two males and 
two females have epicanthus. 
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Fig. 22. A normal woman, I, 2, had a normal child by her first husband and 
nine children by her second husband, of whom seven have epicanthus. 
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Fig. 23. Four affected sons of normal parents. A fifth son, 11, 1, the oldest, be 
it noted, has no epicanthus. Other five sibs, 11, 2, 3, 6, 7, 10, died in infancy. 

I .6^ 

Fig. 28. 

Unilateral epicanthus may result from the natural disappearance of an epi- 
canthus from the opposite side. The artificial disappearance of an epicanthus may 
be the result of operation. If the epicanthus be removed at an early age from both 
sides it is conceivable that in adult life such an individual would be unaware of 
his previous condition, which might never have been reported to the investigator 
of the pedigrees. 

Examples of persistence of an epicanthus for a considerable number of years 
are seen in the next four pedigrees, two of which do not show heredity. 
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Fig. 24. Dorothy B. T., II, 4, at age of 3 had a bilateral epicanthus extending 
from tarsal fold and curving downwards to inner end of lower lid, which was still 
present at age of 28. 
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Fig. 24. 


Fig. 25. Margaret S., II, 7, a twin, at age of 9 had a unilateral epicanthus 
which was still present when she was 19 years of age. 
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Fig. 26. II, 4, a male, the first born of triplets, examined at 5 and 28 years of age. 
Bilateral epicanthus was present on both occasions. 
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Fig. 26. 


In pedigree. Fig. 27, five of six sisters with epicanthus were re-examined 24 years 
later. In four of them epicanthus was still present. The oldest of these was 33. 

The following results have been obtained from the 22 new pedigrees, of which 
a number of charts have been shown. Consanguinity occurs in one pedigree. The 
number of individuals with epicanthus is ] 11, of whom 53 are males and 58 females. 
Transmission from parent to child occurs in 13 pedigrees, by the father in 6 sib- 
ships and by the mother in 14. Discontinuous inheritance occurs in 8 pedigrees, 
including 5 of those that also show continuous inheritance. Epicanthus is confined 
to members of a single sibship in 6 pedigrees. Though epicanthus is transmitted 
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from an affected father, or mother, to the children in only a small majority of 
the pedigrees, transmission is in accordance with what is seen so markedly in 
Meirowsky's large pedigree. Furthermore, as epicanthus is known to disappear 
as age advances in a considerable number of people — see Martin’s table — there 
can be little doubt that some of the remaining 9 pedigrees would have shown 
direct inheritance had all the examinations been made at an early age. When 
these results are added to those from published pedigrees, the following figures 
are obtained : Number of affected persons 180 ; of these 88 are males, 81 females, 
and in 11 the sex is unrecorded. There are 32 pe^ligrees in all*, in 21 of these 
inheritance is continuous, with transmission through males 19 times, and through 
females 24 times. Discontinuous inheritance alone occurs in 3 pedigrees, and 
both continuous and discontinuous inheritance in 7. When it is considered that 
21 of the 32 pedigrees show transmission from parent to child and that a pro- 
portion of normal parents of affected sibships may in childhood have had the 
abnormality, there is good reason to believe that usually in inherited epicanthus 
the transmission is continuous. 

Changes must occur in the appearance of an epicanthus, which, though well 
marked at first, finally disappears. No doubt ophthalmologists see the different 
stages of retrogression, but arc the appearances presented recognised as those of 
an epicanthus which will later disappear? The ill-defined skin folds which in some 
instances are difficult to diagnose as epicanthus are possibly late stages of a 
formerly well-defined epicanthus. Cases have been recorded in this memoir of well- 
marked epicanthus, which after some years appeared to be represented by merely 
short thin skin folds at the site of the upper part of the vanished epicanthus. 
Forster'f had doubt as to whether an obliquely lying skin fold which began at the 
internal tarsal ligament and extended upwards and outwards above the tarsal fold 
was an epicanthus, though he regarded the fold as probably an epicanthus. He 
saw this oblique fold in five eases, aged respectively 23 (2), 20, 37, 19 years. It 
may be suggested that these were possibly late stages in the disappearance of an 
epicanthus palpebralis. The age of the cases supports such a view. It is obvious 
that a thorough investigation of the inheritance of epicanthus in a pedigree re- 
quires not only an examination of each individual, but that this should be made 
at an early age. 

SiMPLifi Epicanthus. 

A short Account of 22 unpublished Pedigrees which show Heredity, 

Fig. 12 (p. 11), Forbes family. Epicanthus in three sibships of two generations. 
Wm., Ill, 20, age 15, Isobel, III, 23, and Catherine, III, 24 have left unilateral 
epicanthus; Ann, III, 26, has bilateral epicanthus more marked on left side. Their 
mother, II, 5, has no epicanthus. One of the twins. III, 21 and 22, died at birth, 
the other and Patricia, III, 25, and Alice, III, 27, have no epicanthus. Douglas Y., 

* Figs. 26 and 26 are not inoluded in this statistical summary as the author considers that they do 
not show heredity. 

t Forster, A., Anatomiacher Anzeiger, Bd. 62, No. 3/4, S. 49 — 68, 1919. 
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III, 28, age 4, and his brother William, III, 29, age 11 weeks, have bilateral 
epicanthus. Their mother, II, 8, has no epioanthas (1934), yet when seen (1918) 
at the age of 5, epicanthus on left side was recorded, right side was normal. I, 1, 
fftther of II, 8, and II, 7, deceased. Besides those marked in the chart, 20 other 
members were examined, including font females the oldest in sibship II, 1 — 8 and 
their children, and also I, 2, the mother of II, 8. None of these had epicanthus. 
No consanguinity. 

Fig. 21 (p. 16), Gerrie family. Four with epicanthus in a sibship of six. Mary G., 
II, 1, in 1914, at age of 5, had marked bilateral epicanthus which was absent in 
1984. Mrs L., II, 2, and her child. III, 1, have no epicanthus. Alice G., II, 3, age 
21 years, has epicanthus on right side, none on left, extending from tarsal fold 
above to just below inner end of lower lid. Richard G., II, 6, age 15, with bilateral 
epicanthus; distance from one epicanthus edge to that of the other is 38 mm. and 
from outer canthus to edge of epicanthus is 25 mm. James G., 11, 6, age 13, has 
bilateral epicanthus ; distance from edge of one epicanthus to that of the other is 
38 ram. and from outer canthus to edge of epicanthus is 23 mm. I, 1 h^ no 
epicanthus. It is not stated whether I, 2, now dead, who came with a daughter in 
1914, was examined for epicanthus, though this is likely. No consanguinity. 

Fig. 28, Mack family. Ill, 4, male, age 3 (1905) was brought for a definite 
but not very marked bilateral epicanthus. In 1911, R. and L. fundus normal, 
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refraction H. in each R.V. with with 

mentally deficient. His mother, II, 3, now deceased, had an epicanthus on right 
side only. She stated that it was a family complaint and that her sister’s children, 
III, 5, and 6, have it. When the mother, II, 5, of these children was written to 
in 1934 she replied that it does not exist in herself or her daughters, nor in any of 
her brothers and sisters, nor in any member of her brother L.’8, II, 8, family, but in 
her granddaughter, lY, 2, 16 months old, she could see a faint indication of it. 

2—2 
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Maternal grandfather, I, 3, deceased. Maternal grandmother, I, 6, has feebly marked 
bilateral epicanthas. I, 6 are deceased. I, 4 and 7 and II, 4 are normal Epi- 
canthus palpebralis was the form in each affected person. No consanguinity. 

Fig. 11 (p. 10), Findlay pedigree. Epicanthus in the three youngest of six 
sisters. II, 6, Daisy F., bilateral epicanthus seen at 6 months and years of age. 

II, 4, Jessie F., and II, 6, Hose F., age 1 year, have bilateral epicanthus. II, 1 died 
in infancy; Marjory F., II, 2, age 12, and Dorothy F., II, 3, are normal. Father, 
I, 1, and mother, 1, 2, have no epicanthus. The former has congenital coloboma at 
nasal side of right iris and microcornea of same eye. R, V. with high power -f lens 
— reading small newspaper pi-int at 10". Seen in bed with cardiac disease. No 
consanguinity. 

Fig, 13 (p. 11), Cleveland pedigree. Shows direct transmission from mother to 
children. The family lives in Aberdeen. Sheila C., Ill, 1, firstborn, age 11, and 
Jill C., Ill, 2, have no epicanthus. Jaqueline C., Ill, 3, age 6, and III, 4, an 
infant, have marked bilateral epicanthus. Mother, II, 2, has definite bilateral 
epicanthus. She stated that her mother, I, 1, who lives in England, has the same 
condition, but this was not confirmed by the family doctor. No consanguinity. 

Fig. 14 (p. 12), Brown pedigree. James B., Ill, 4, age 7, illegitimate, and 

III, 5, 6, 7 have bilateral epicanthus tarsalis. Mother, II, 4, has no epicanthus. 
Father, II, 1, is abroad, II, 2 not seen. Ill, 8, Adam B., age 7, illegitimate, has 
bilateral epicanthus tarsalis. His mother, II, 6, has it on left side only. II, 6, II, 3, 
and III, 1, 2, 3 have no epicanthus. Ill, 1 is aged 6. No consanguinity. I, 1 
and I, 2 have no epicanthus. Had II, 5 bilateral epicanthus at an earlier age and 
did II, 4 formerly have epicanthus ? 

Fig. 15 (p. 12), Poison pedigree. A boy and two maternal uncles with epi- 
canthus. Parents, III, 2 and 5, and maternal grandmother, II, 3, have no epicanthus. 
Gordon H., Ill, 12, age 16, has a minor dfjgree of epicanthus bilateralis. Neil H., 

III, 14, also has bilateral epicanthus better defined on right side. Lizzie H., Ill, 8, 
has broad root of nose, but no fold at either inner canthus. Paternal grandmother, 
II, 1, very broad from one inner canthus to the other, no epicanthus, skin wrinkled. 

IV, 4 and 5, no epicanthus. IV, 6, III, 6, and IV, 1 died in infancy. Ill, 9 married, 
no offspring. 

Fig. 30, Calder pedigree. Showing both direct and indirect inheritance. Nan C., 
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III, 1, age 9, illegitimate ; her mother, II, 4, and uncle, II, 3, unmarried, have 
bilateral epicanthus tarsalis. II, 4, not consanguineous with II, 1, or her husband, 

II, 2, by whom she has two normal sons. III, 2, 3. II, 5, with no epicanthus, has 
three sons, one (III, 4) of whom was killed. The other two have epicanthus tarsalis, 
bilateral in 111, 5, age 7, and unilateral on right side in III, 6, age 3. I, 1 and 2, 
deceased. It is of course open to doubt whether II, 5, at an earlier age, had 
epicanthus. No consanguinity. 

Fig. 20 (p. 14), Jamieson pedigree. Epicanthus in a child, her paternal uncle 
and a paternal aunt. Disappearance of an epicanthus from operation. Jas. J., II, 3, 
age 28, unmarried, at age of 3 had marked bilateral epicanthus which covered the 
caruncles ; distance from edge of one epicanthus to that of the other was 26 mm. 
Excellent result after Wicherkiewicz operation on each. Annie J., II, 7, age 12, 
with unilateral left-sided epicanthus, has internal concomitant strabismus. Irene J., 

III, 3, age 2, with unilateral left-sided epicanthus. Father, II, 4, has no epicanthus. 

I, 1, decetised, his wife states had no skin fold, worked at blind asylum all his life, 
eyes inflamed in infancy. No consanguinity. 

Fig. 26 (p. 16), Nicol pedigree. Epicanthus in four members of a sibship of 
seven. II, 4, 5, 6 are triplets, age 28; of these, the first born, II, 4, and the last 
born, II, 6, have bilateral epicanthus, which was first seen in II, 4 when he was 
examined at 5 years of age. He was brought for divergent squint which passed 
off ; refraction of each eye was H- 1 D. Their mother says that the two triplets, 

II, 4 and II, 6, were remarkably alike. II, 5 has no trace of epicanthus. II, 2, 
female, ago 30, has marked epicanthus on left side ; on right side is merely a trace. 
In all these cases the epicanthus arises from the upper lid tarsal fold. II, 1, 
male, age 31, not seen, but, to judge by the account of his brother, II, 4, and a 
photograph, he has bilateral epicanthus best marked on left side. Father, I, 1, 
deceased. Mother, I, 2, has no epicanthus. II, 3 died in infancy ; II, 7 is abroad, 
age 26 ; photograph shows no epicanthus. No consanguinity. 

Fig. 22 (p. 15), Maitland pedigree. Normal parents with seven children in a 
sibship of nine with epicanthus. Mary M., II, 2, age 13, with bilateral epicanthus. 
II, 3 died in infancy. II, 4, age 9, has feebly marked bilateral epicanthus, nose 
well developed. Father says the skin folds were formerly more marked in her 
than in any of the others. John M., II, 5, Edward M., II, 6, and Daisy M., II, 7, 
have bilateral epicanthus. James M., II, 8, is normal. Margaret M., II, 9, and 
Francos M., II, 10, age 7 months, have bilateral epicanthus. These are Mrs M.’s, 
I, 2, children by her .second marriage with M., I, 3. By her first marriage with W., 
I, 1, she had a son, T. W., II, 1, an only child without epicanthus. Neither he nor 
his parents have epicanthus. Both I, 2 and I, 3 are normal. No consanguinity. 

Fig. 18 (p. 13). Lemmon pedigree. Showing continuous transmission in three 
generations. I, 8 and his daughter, II, 13, have bilateral epicanthus, and his 
grandson, III, 11, right-sided unilateral epicanthus. II, 19, when brought for 
internal concomitant strabismus when 5 years of age (1916), had well-marked 
bilateral epicanthus, which is apparently unchanged (1934). It covers the plica 
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semilunaris and caruncle, arises from tarsal fold of skin on upper lid, and is 
attached below to the skin down and in from inner end of lower lid. 1, 5, Mrs G., 
with bilateral epicanthus, had an illegitimate daughter, II, 9, by a half-caste, 1, 4 ; 
she married I, 6, but had no more children. II, 9 and her children. III, 6, 7, 8, 
have no epicanthus. II, 21, the youngest in the sibship, is 19 years of age. Abroad; 

I, 7; II, 2, 4, 10. Married without issue: I, 6; II, 6, 7, 10. Deceased: I, 1, 2, 11; 

II, 6, 11, 12, 14, 16. 17, 18; III, 5, 9. No consanguinity. Borne of those referred 
to are not shown in the abbreviated chart. 


Fig. 31, Dunbar pedigree. Ill, 4, Susan D., age 14 months, with well-marked 
bilateral epicanthus and alternating internal strabismus, is the child of a man, 
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Fig. 81. 


II, 2, age 49, with congenital club foot, and a woman, II, 3, with bilateral epi- 
canthus. By his first wife, II, 1, he, II, 2, had three normal children, III, 1, 2, 3, 
ages 16, 14, 10 years. I, 1 and 2 are dead; no information regarding the presence 
of epicanthus in them is obtainable. No consanguinity. 

Fig. 23 (p. 15), Stewart family. Epicanthus in four brothers in a sibship of ten. 
II, 9, George S., age 6 (1932), with marked bilateral epicanthus and right con- 
vergent internal strabismus. When seen again two years later, it was noted that 
the epicanthus covered the plica semilunaris and caruncle. It extended from tarsal 
fold of upper lid to below inner end of lower Ikl. II, 8, Gordon S., age 11, bilateral 
epicanthus more marked on left side. II, 5, Robert S., age 14, bilateral epicanthus 
better marked on left side. II, 4, Wm. S., poorly marked bilateral epicanthus. Died 
in infancy: II, 2, 3, 6, 7, 10. Parents, I, 1 and 2, normal. No consanguinity. 

Fig. 32, Massie pedigree. A mother and two sons with bilateral epicanthus. 
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II, 1, Alexander M., age 10, and his brother Jim, II, 2, age 8, to a less extent, 
have bilateral epicanthus. The tarsal fold is continued downwards to level of lower 
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lid margin, where it ends on side of nose. Their mother, I, 2, presents the same 
appearances, caruncle and plica normally developed. Father, I, 1, has no epi- 
canthus. No consanguinity. 

Fig. 33. Rey pedigree. Bilateral congenital epicanthus in a female, her niece, 
and paternal grandmother. Ill, 9, A. R., female, age 22, has well-marked bilateral 
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epicanthus continuous with tarsal fold of upper lid, no ptosis, caruncle visible, 
lower punctum not displaced outwards. None of her three brothers, III, 6, 10, 11, 
or two sisters. III, 7, 8, shows the anomaly, but her niece, IV, 3, has definite 
bilateral epicanthus continuous with tarsal fold of upper lid. I, 2, her paternal 
grandmother (not seen), now dead, had exactly the same condition. When A. R., 
Ill, 9, was born, the then family doctor (deceased) reminded her mother, II, 7, 
of the condition in I, 2. II, 7 says that her mother-in-law was able to stretch the 
epicanthus by raising her brows. She, II, 7, and her husband (deceased) had 
no epicanthus. She was fifth born in a sibship of eight ; neither of her 
parents nor any progeny of her brothers and sisters (not shown in chart) had 
epicanthus. Ill, 1 and the elder of the two sisters, III, 4, are married, without 
issue. No consanguinity. 

Fig. 34, Donald pedigree. Father and two sons wdth bilateral epicanthus. II, 1, 
age 3J, has also left internal concomitant strabismus. In the father, the epicanthus 
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begins below the tarsal fold of upper lid and curves downwards to a point below 
the inner canthus. It has a more solid appearance than usual. 
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Fig. 27 (p. 17), Gray pedigree. Six sisters with epicanthus. Be-examination 
of five of them after an interval of 24 years. Ill, 30, Mary G., age 16 (1912), had 
a minor degree of epicanthus more marked on left side, which had disappeared by 
1934; bridge of nose well developed. Ill, 32, Joan G., age 9 (1910), brought for 
blepharo-conjunctivitis, had well-marked bilateral epicanthus which had not altered 
by 1934. Ill, 33, Jeannie G., age 8 (1912), with bilateral epicanthus which had 
not altered by 1934. Her daughter has no epicanthus. Ill, 36, Daisy G., age 6 
(1912), with bilateral epicanthus, which by 1934 had disappeared from left side 
and was fully marked on right side. Ill, 36, Annie G., age 4 (1912), with bilateral 
epicanthus which was still present in 1934. Ill, 37, Williamina G., age 2 (1912), 
with unilateral left-sided epicanthus, in 1934 was abroad. Ill, 31, male, died age 2 
(not seen), had no epicanthus. Ill, 34 and 38 have no epicanthus. None of those 
affected in the sibship is married except III, 33. Both parents, II, 6 and II, 8, 
are normal. The mother, II, 8, now dead, said in 1912 that her husband's sister, 
II, 1, deceased, and one of her daughters had the same condition as her own 
daughters. This could not be confirmed in 1934 as the three daughters, III, 4, 6, 

II, had then no epicanthus. A fourth daughter. III, 12, died young. I, 1, female, 
age 94 (1912), knew nothing of the condition in any of her relatives when inter- 
viewed by II, 8. No consanguinity. 

Fig. 19 (p. 14), McLean pedigree. Sixteen affected members in three generations. 
Transmission continuous through male or female. In a sibship of 11, epicanthus 
bilateralis was present in seven; in one there was no epicanthus, and in three 
(II, 1, 8, 10) only feebly marked unilateral epicanthus was found. I, 1, age 66, is 
normal; I, 2 has bilateral epicanthus palpebralis which has a flattened appearance, 
skin is thin. Ill, 2, one died in infancy, three still-born. Ill, 1, age 10. Ill, 3, 
age 3. HI, 4, age 10 months, has left unilateral epicanthus. Ill, 5, age 3 years, and 

III, 6, age 4 months, have bilateral epicanthus palpebralis. Ill, 7 and III, 8 have 
bilateral epicanthus tarsalis, and so have II, 5 and II, 7, but not so markedly. No 
consanguinity. 
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GROWTH CHANGES IN BODILY SIZE AND PROPORTIONS 
DURING THE FIRST THREE YEARS: A DEVELOPMENTAL 
STUDY OF SIXTY-ONE CHILDREN BY REPEATED 
MEASUREMENTS. 

By nancy BAYLEY and FRANK C. DAVIS. 

(University of California, Institute of Child Welfare.) 

T. INTRODtTCTION. 

Investigators of the problems of growth changes in bodily size and proportion 
have frequently pointed out that seriatim measures, in a group large enough for 
statistical treatment, are greatly to be desired. Such measures over the entire 
growth period would, of course, require many years; in consequence, most seriatim 
studies cover only a few years of the growth cycle, or are limited to few cases. These 
studies have usually been limited further by restriction to measurements of height 
and weight only. 

The present study presents cumulative data for the first three years of life. 
During these early years growth is, of course, much more rapid than at any later 
age, and consequently a greater portion of it is included for a given unit of time. 

The use of a constant sample through a three-year period presents many 
advantages. With a constant sample we need not make allowances for sampling 
fluctuations from one age level to the next, and so may work with a smaller number 
of cases than would otherwise be necessary for securing growth and variability trends. 
From repeated measures, we can determine the stability of growth rates in 
individuals. We can study the relationship between growth rates of different parts of 
the body, for the group as a whole, and for individual members in the group. We can 
study the consistency in types of build between the earlier and later ages measured. 
Finally, as a result of these, we may learn, in part, the extent to which one may 
predict adult size and body build in infancy. 

If, as a number of students of body build have claimed, build and certain 
mental characteristics are related, findings of inconstant body characteristics would 
suggest a search for concomitant variations in pei*sonality traits. 

Anthropometric studies of infants are attended with certain serious difficulties. 
One of these is the problem of securing normal unselected samples. This difficulty 
is decreasing in recent years, with the prevalence of Well-baby clinics, and the 
direction of attention to the importance of infant health and welfare. Another 
difficulty lies in the comparative unreliability of anthropometric measures of infants. 
Infant measures are less reliable than those of adults because they are smaller; the 
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importance of chance variations is thus comparatively greater, even though the 
absolute value of these variations may be the same. But an even more important 
factor is that of the emotional reactions of the in&nte to the procedures of 
measuring. Before they reach an age where they can understand what is wanted 
of them and are no longer afraid of the measuring instruments, many infisints 
protest, more or less vehemently, against being held in position for the measures 
and against the application of the instruments. Besides this is the fact that the 
examiner is usually not well known to the child, and that children between the ages 
of six months and two years or so of age usually dislike being handled by strangers. 
The result is that the measures must at times be made on a tense, struggling, crying 
child. This situation odds greatly to the difficulty of making accurate, reliable 
measures. Often the measure must be caught in a fleeting moment when the child 
is in the correct position, and comparatively quiescent. As Baldwin (5) has pointed 
out, however, much can be done by the examiner, in the manner of his approach, 
to put the child at his ease and so secure more accurate measures*. 

Since normal samples of healthy infants are now available, cmd the examiner 
may learn a method of approach which diminishes the child's resistance to being 
measured, the only remaining question is that of the reliability of the measures, as 
such. Bakwin and Bakwind), who recently made a large number of measures ou 
infants under one year, repeated a series of measurements on the same children 
and found what they felt was a high degree of reliability — at least as reliable as 
Todd (88) and others have found for adults. For ten repeated measurements of from 
11 to 20 infants on total body length, sitting height, diameter of face, bicristal 
diameter, biacromial diameter and circumference of thorax at nipples, the coefficients 
of variation ranged from ’04 per cent, for length to 1*3 per cent, for bicristal 
diameter f. It seems possible, then, that the disturbing factors can be overcome, 
and that a study based on the anthropometric measurements of infants may be of 
some significance. 

We are quite familiar, now, with the difference between infants and adults in 
body proportion, and some of the changes they undergo in the process of maturation. 
The relatively greater maturity of the upper end at birth, and the cephalo-caudal 
trend of development is true, in general, of body proportions (Qodin(iO), Bardeen(7), 
Bean<8)), as it is of neural and motor development (Coghill, Shirley). However, 
there is still much to learn about the relative growth rates of the diiferent parts, 
especially in the same individuals. Seriatim measures have not hitherto been made 
on a sufficiently large number of infants to allow adequate statistical treatment. 
Valuable studies of proportional changes with growth (based, for the most part, on 
samples which are made up of different individuals at successive ages) have been 
made by Scammon and Calkins (85) for foetal growth, Scammon and others ( 84 ) for 

* This does not neoessarilj mean oar]:7ing on oonTersations with a cdiild as Gray and A^res (88) 
have inferred, bnt is often a matter of the examiner’s attitude, which may vary according to the child. 

t Their data have not, however, been treated in such a way as to make possible comparisons between 
remeasnres on the $ame individual, since the measures on all infants have been grouped together. 
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post-natal growth, and Boasdo) for rates of growth in height and weight. Baldwin 
( 5 ) (p. 107) has done this for a large number of seriatim measures on a few children. 
Godin ( 19 ) has shown diagram matically the changing proportions from the foetus to 
the adult ; the foetus has a proportionately large head and short extremities, 
especially legs, and the relative proportions of these change strikingly with growth. 

Most of the above-mentioned studies have been concerned wnth the rates of 
growth and changes in proportion to the various body parts. A closely related field 
is that of the consistency in growth rates within the individual. Baldwin (6) has 
found, for example, that children who are tall for their age continue to be tall for 
their age as they grow oldi^r, while short children remain relatively short. Boas (10), 
on the other hand, has found recently that persons of the same adult height reach 
that height by different growth rates, and with different ages of most rapid growth. 
Hence, “acceleration or retardation within the same social group has apparently 
no influence upon adult stature.” Morrell ( 81 ), studying rabbits, also found no rela- 
tionship between rapid tiarly growth and adult size. Studies of consistency must, 
necessarily, be made from seriatim measures, and must concern themselves with 
the independent growth curves of individuals. 

Another interesting field for investigation is that of body build. Only a few 
studies of build have been made on infants. Bakwin and Bakwin ( 4 ) claim to have 
found a twofold division of body types in infants. Lcderer ( 29 ) has made an 
elaborate classification of four types, giving them names which imply characteristic 
accompanying behaviour patterns or mental traits. There is considerable disagree- 
ment among investigators of body build in Jidults, as to whether there are distinct 
types, and, if so, what the types are. We may expect to find the same disagreement 
in respect to infants, but if we can choose a criterion (or criteria) for measuring 
individual differences in infants’ build w^e may then determine whether those 
differences remain constant, and whether they are related to other mental or 
physical characteristics. 

The height and weight of infants under one year have often been measured; 
and norms based on a large number of measures have been presented. But addi- 
tional anthropometric measures on such young subjects are few, though there have 
been several recent studies of infants in which a large number of measures were 
made. Probably the most complete series of measures has befen reported by the 
Iowa Child Welfare Research Station (20). Recent studies by Freeman (iSa, igb) 
and Freeman and Platt (l8c) give means, s.D.'s and indices for a large number of 
measures made at frequent age levels on children from birth to 14 years. Other 
studies are those of Bakwin and Bakwin (i, 2, 8), of Boyd at Minnesota, and of 
Thompson at the Yale Psycho-Clinic. The results of these last two studies have not 
yet been published. Other studies, not quite so recent, are those of Fleischner ( 18 ), 
Montague and Hollingworth ( 82 ), and Taylor ( 86 ). 
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IL THE PRESENT STUDY. 

Au Sample. 

The present study was undertaken with a group of sixty-one infants who 
were measured at monthly intervals from one to twelve months, and at fifteen, 
eighteen, twenty- four, thirty and thirty-six months of age. They were measured 
oil, or within a few days of, their birth date. During the firsfr year the observations 
were made, if not on the birth date, within four days of that date. Longer devia- 
tions from the birth date were allowed later, if necessaiy (in the third year as much 
as two weeks), but these wide deviations were rare. This method of seriatim 
measures made possible a comparison of growth tendencies at different stages 
of development, based on measures of a sample which was identical, or nearly so, 
throughout the series. There were occasional absences, and a number of the children 
moved out of town before the end of the three-year period. Forty-nine of them 
were still enrolled in the study at thirty-six months. 

The method of obtaining the sample caused some selection, making for a some- 
what homogeneous group, although originally an attempt was made to get babies 
from families representative of a cross-section of the community. The subjects 
observed form a large proportion of Berkeley babies born in two hospitals during 
a six -month period. Of these babies, we included only those whose parents were 
white, English-speaking, and willing to co-operate by monthly visits to the Institute 
of Child Welfare for a series of observations. These observations included, in addition 
to the anthropometric measurements, tests of mental, motor and physiological 
development. 

A comparison of this Growth Study group with a larger group of Berkeley 
families in which infants were born at about the same time*, show to what extent 
they are representative of the Berkeley population. This comparison shows that 
the families of this smaller group average higher in annual income, education, and 
occupational rating than the unselected population of families into which babies 
were born during this time in Berkeley. The mean annual income of the Growth 
Study is $2844, as compared with the Berkeley Survey mean of $2544. The mean 
educational level of the Growth Study parents is, for the fathers 13*8 years and 
for the mothers 13*1 years, while the Berkeley Survey fathers average 12*2 years 
and the mothers average 11*G years of schooling. Thirty per cent, of the fathers of 
the Growth Study infants were classified in professional or executive occupations, 
as compared with 22 per cent, of the Berkeley Survey group. An equal proportion 
of both groups were classed in “white collar” occupations, while the smaller group 
had comparatively fewer in the skilled, semi-skilled and unskilled labour categories. 

From these comparisons, it may be assumed that this group of infants comes, 
for the most part, from homes where the education and the income combine to 
insure hygienic living conditions and sufficient nourishing food; and hence, well- 
nourished, healthy babies. This selection may account, in part, for the fact that 

* The Berkeley Survey, a study made bj Frances M. Welch and Agnes Covalt. 
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the mean measures of this group of Berkeley babies are larger, after the sixth 
month, than those of other published infant norms (41). 

The group is probably also, because of similar racial stocks and freedom from 
environmental handicaps, more homogeneous than would be an unselected sample. 
For this reason many of the correlational values between measures will be more 
significant than the same values would be for a more heterogeneous group of 
infants. 

B. The Measures : Instruments and Technique. 

The measures which we undertook to make were selected on the basis of two 
criteria: the accuracy with which they could be made on infants, and their probable 
value in indicating growth trends in size and bodily proportions. The measures 
taken were length (reclining during the first eighteen months, afterwards standing); 
stem length (reclining during the first eighteen months, afterwards sitting); widths 
of shoulders, chest and hips ; chest depth ; circumferences of head, chest and 
abdomen ; head diameters, transverse and antoro-posterior ; and weight. This 
selection of measures and the methods of taking them are based, for the most 
part, on those used at the Iowa Child Welfare Research Station. As the technique 
has been described in a recent publication (26), we will indicate here only our 
deviations from it. In the present study we have not considered the abdominal 
circumference or the head diameters. 

The instruments used during the first eighteen months varied from those used 
at Iowa in several respects. Chest depth was measured with the spreading callipers; 
and the shoulder, chest and hip widths with the small sliding callipers. This 
change was made because the large sliding callipers were found to be awkward 
with the young infants. From eighteen months on, the large callipers were used 
in the chest, hip and shoulder diameters. Circumferences at all ages were measured 
with a linen tape graduated in millimetres, and having a spring attachment at one 
end by which the pressure could be kept constant*. The tapes were replaced at 
intervals to prevent inaccurate measures due to their shrinkage with use. Standing 
length and sitting stem length were measured on a stadiometer made by the 
Continental Scale Works in Chicago. Since this stadiometer was calibrated in 
inches, the measures were read in inches and quarter-inches, and converted to 
centimetres for treatment of the data. Infants under about twenty pounds were 
weighed on a light portable spring scale whose accuracy was checked at frequent 
intervals ; the heavier babies were weighed on a balance scale. Both scales were 
calibrated in pounds and ounces. These have been converted to kilograms for 
treatment of the data. All measures were made with all clothing removed. At 
about nine months, it was found to be impossible to measure most children while 
lying down, without vehement protest. For this reason, the head, chest, and 
shoulder measures were made with the child sitting, until he was able to stand 
with support, when all of the body diameters and circumferences were taken 
standing. After eighteen months, the lengths also were taken standing. 

* These are made by the Narraganset Machine Co., Providence, B.I. 
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Further difihrenees in procedure on individual measures are as follows: For 
length* of the stem in the erect posture the child sat in the stadiometer^ with 
its spine in contact with the stadiometer^ and knees flexed by placing its feet up 
on the platform in the position described by Dreyer ( 17 ). The experimenter then 
held the head in place and measured in the same way as for standing lei^gth. 
Chest depth : when the spreading callipers were used, the fixed branch was placed 
on the sternum at the nipple lin^, and the movable branch at the centre back, 
just below the scapulae. 

In order to obtain accurate measures, every effort was made to keep the child 
quiet and contented. An active restless child would not maintain the optimum 
position long enough for a measure to be read, and the chest measures of a 
violently crying child varied far more widely than those of one breathing quietly. 
The difficulty of obtaining accurate measures increased as the infants grew older 
and more active until, at eighteen or twenty-four months, they could be made 
interested in the procedure and persuaded to stsind still and submit willingly to 
being measured. The whole situation was made more difficult by the^ fact that 
mental, physical and motor teats and the taking of photographs were a part of the 
procedure during the period of observation, so that the child was often excited or 
tired before any anthropometric measures were taken. The experimenter measured 
the child with as little handling as possible, and always with a gentle friendly 
approach. But from about seven to eighteen months, the situation was complicated 
by the child’s emotional reaction — at times violent — to the strange situation and 
to being handled by strange persons. In the difficult cases the mother usually 
co-operated by holding the child in positidn. These variable factors probably re- 
sulted in some inaccuracies in measures. For this reason, measures taken under 
unfavourable conditions, when they deviated widely from other measures of the 
same child, were not used in the treatment of the data. 

Every effort was made to overcome unreliability of the measures, by carefully 
controlling the position of the child, and by securing its acquiescence whenever 
possible. Some of the measures, especially weight and height, could be made fairly 
accurately, even though the children were resistant, while others were greatly 
affected by variations in attitude toward the instruments and the examiner. An 
example of the latter is shoulder width. The child w^ho shrinks from the callipers 
and draws its shoulders together may decrease its shoulder width by several 
centimetres. On the other hand, the unreliability of a measure may be a function 
of the dimension itself. The amount of fat or the firmness of the muscles over the 
points measured, in hip width, for example, may vary enough in the same child, 
from one month to the next, to change His relative position in the group. Other 
measures, such as chest depth, can be changed by assuming different body postures 
which change the relationships of the component parts involved in the measure. 
Since children so young cannot be expected to co-operate, or even to understand 

The term **lengtli ” hea beett fubatituted for ** height,** in thia paper, to applj to both zeciining 
length and atanding height ; azkd the icrmbol Ii. ia need inatead of H. in the indioea and tables, i.e. W./L.*. 
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the positions desired by the experimenter, we may expect to find such measures 
relatively unreliable. The most reliable measures should be those taken from bony 
points without intervening joints, and with little flesh covering. Some of these 
may be read accurately, even on a protesting child. 

Most of the measures taken in this study were made by one person (Bayley), 
but in a few instances they were made by the other experimenter in the Growth 
Study, Dr L. V. Wolff. After determining a given measure, the experimenter read 
it aloud to a recorder who entered it on the record form. 

C. Ratings of Body Build from Photographs. 

In order to evaluate the validity of the different body build indices, a criterion 
was set up on the basis of photographs taken under standard conditions. The use 
of photographs made it possible to compare all of the children of a given age at 
one time. 

(a) Method of taking Photographs, 

The infants wcn^ photographed, with all clothing removed, at ages 1, 3, 6, 7, 9, 
12, 15, 18, 24, 80 and 36 months. The pictures were taken with a Graflex camera, 
indoors, with standard lighting conditions, lens at F4}'b, and a *1 second exposure. 
The infants at first lay or sat on a canvas table with the camera placed at a 
standard distance. When the children were able to stand alone, they were placed 
on “foot-prints’’ painted on the floor, with the camera at a new standard distance. 
Three exposures were always made. In the early months two exposures of the 
child in the dorsal and one in the prone position were taken. When a child was 
able to sit alonc^, one exposure in the sitting position was substituted for one of the 
dorsals. When he was able to stand alone, one front, one rear, and one side view, 
all standing, were taken. However, it was not always possible to persuade a child 
to stay where he was put for a picture, so that many of the resulting photographs 
are not exactly as desired. Usually at least one of the three was sufficiently in 
position to make possible a fair estimate of body build, while for the 30 and 
36-moiith pictures, better co-operation of the children resulted in very satisfactory 
pictures. 

(h) Method of Rating. 

For the purpose of rating body build from the photographs, the pictures wore 
sorted into groups according to age, with each child’s pictures for one age fastened 
to a single large card. The ratings were made by comparing each child with the 
others of the same age. To do this the rater sorted the cards into seven stacks, 
if that many divisions could be made, putting pictures of similar body build 
together. Each age group was rated separately without reference to other ages, 
and each characteristic considered was rated independently. Standing pictures 
were not compared with sitting, nor sitting with reclining pictures. 

A rating scale of seven different body proportions was prepared, with a seven- 
point rating for each. Since only one of these proportions — general chubbiness, 
or distribution on a lateral-linear scale — could be rated adequately from the 
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reclining and sitting pictures, it is the only one considered in this paper. Chubbi* 
ness was rated for all months at which pictures were taken, independently by two 
judges, one (JB) who had measured the infants, and one (Z)) who was not femiliar 
with the children and so was not influenced, in his ratings of the photographs, by 
any preformed opinions of a child’s build. The pictures at 15, 18 and 24 months 
were rated by the writers and by Dr L. V. Wolff. The scores used for comparison 
with the objective measures are the combined ratings of the two (or, where 
available, three) judges. 


III. RESULTS. 

A. The Reliability op the Measurements. 

The best measure of the reliability of data such as these would be that afforded 
by the coefficients of correlation from repeated measurements taken during one 
examination period. This procedure was not followed in the present instance, for 
several reasons. In the first place, the amount of time allotted for the examination 
of a given child, at any one examination period, was too brief to permit the repeti- 
tion of the several anthropometric measurements and still allow for the other 
observations which were included in the plan of testing. Again, the attempt to 
repeat the anthropometric measurements would have led (particularly at certain 
ages) to emotional disturbances in some of the infant subjects which would have 
rendered the second series difficult to obtain and often unreliable when they could 
be obtained. 

In the absence of reliability coefficients derived from repeated measurements 
during a given examination period, the reliability of the measurements reported 
upon herein may be estimated in the following ways: (1) by a study of the 
coefficients of correlation obtained when one month’s measurements are paired with 
those taken at the next succeeding examination period ; (2) by the remeasurement 
of a small group of infants after an interval of one week ; and (3), less directly, by 
a consideration of the consistency of the trends of central tendency for the group. 

(1) Low coefficients of correlation when successive monthly measurements are 
correlated with each other could be due either to errors in the taking or in the 
recording of the measurements, or to pronounced shifts, during the interval between 
the two examinations, in the relative position of the individuals composing the 
sample. High correlations, on the other hand, can mean only that the measure- 
ments were made with a relatively high degree of precision and that there is a 
definite tendency toward stability in the rank orders of the subjects from month 
to month. Table I gives the self-correlations obtained in that way. Weight, total 
body length, and head circumference are seen to be highly stable measures. Stem 
length, hip width, and chest circumference rank next. Shoulder width, chest width, 
and chest depth are relatively unreliable, and this fact needs to be taken into 
consideration when the respective merits of various proposed indices of body build 
are under consideration. It may be noted here that our results, based upon seriatim 
measurements made on living infants, are in agreement with the conclusion of 
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TABLE I. 


Self Correlations between Consecutive Measurings, 


Months 

Mean 

1x2 

2x3 

3x4 

4 x5 

6x6 

6x7 

7x8 

8x9 

9x10 

10x11 

11x12 

12x16 

15x18 

18x24 

24x30 

80x36 

f 

•909 

•945 

•931 

•947 

•961 

•943 

•954 

•901 

•mil 

•941 

•965 

•957 

•931 

•931 

•936 

•961 

•946 

•918 

•920 

•898 

•889 

•942 

•950 

•939 

•93(5 

•914 

•928 

•932 

•948 

■921 

•916 

•947 

•974 

•980 

•91« 

•934 

•937 

■920 

■954 

•942 

•941 

•870 

•916 

•911 

•922 

•024 

•919 

•912 

•904 

•950 

•924 

•772 

•847 

•821 

•761 

•879 

•830 

•800 

•875 

•887 

•904 

•883 

•803 

•871 

•78-1 

■760 

•779 

•835 

•857 

•810 

— 


•794 




•779 




•a39 

— 

•745 

•812 

•864 

•933 

•825 

•829 

•821 

— 


•705 





•881 





•837 



•802 

•823 

•839 

•884 

•819 

•709 

•603 

— 

— 

•609 





•808 





•637 

— 

•545 

•638 

•765 

•874 

•702 

•706 

•028 




•099 





•700 





•829 



•667 

•541 

•737 

•738 

•690 

•070 

•710 

— 

— 

•561 

— 

— 

•001 

— 

— 

•580 

— 

•394 

•677 

•731 

•884 

•059 

•718 

•741 

•808 

•857 

•800 

•726 

•871 

•874 

•800 

•832 

•882 

•874 

•830 

L 

•732 

•740 

•911 

•816 


Scammon and Calkins ( 85 ), based upon repeated measurements made on human 
foetuses, that total body length (crown-heel length) is a more highly reliable measure 
than stem length (crown-rump length). Boyd (is) has also found, on remeasuring 
Nursery School children, that standing height is by far the most reliable of a series 
of length measures. 

(2) As an additional check on the reliability of the measures, fifteen infants 
in a San Francisco baby home, ranging in age from nine days to seven months, 
were meiisured, and then remeasured by Bayley after an interval of one week. 
Unfortunately it was impracticable to measure more cases in this way, and since 
the number of cases is so small, the results must be considered as only tentative, 
and are interesting, mainly, because they corroborate the results of comparisons 
between measures made one month apart. Table II gives the means and s.D.'s, 
and the differences between the first and second measures. The variation is much 
greater for some measures than for others. The greatest mean difference is in 
length, the smallest in hip width. Since length is also the longest measure taken, 
its greater variation does not necessarily invalidate it. When we convert the mean 
differences into percentage of the mean of each measure (column 9) we find that 
the head measures are relatively most consistent, while stem length, total length, 
hip width and chest circumference all have mean differences of less than 2 per cent, 
of their mean size. Chest width shows the greatest percentage variation (3*6 per 
cent.), with chest depth and shoulder width also relatively unreliable. 

(3) The curves of the means for all measures are very smooth when one 
considers that they are composed of between 60 and 60 cases at each age. This is 
due, in large part, to the constant sample. Gross errors, however, either in the 
taking of the measurements or in their recording, would presumably have led to 
a greater fluctuation in the means. Such fluctuation would be especially evident 
in the means for the sexes separately. Table IV and Figs. 2 and 4 show that 
these sex differences are small but consistent. If they were due to chance errors 
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TABLE II. 


Repealbed Meaaurea of Infanta m San Franoiaoo Baby Home*. 

(Oentimetres.) 


1 

2 

3 

4 

n 

6 

D 

8 

9 


No. 

Mr 


Af, 


M 

Diff. 

a 

Diff. 

Jf Diff. 

Mr 

Length 

13 

53-95 

3-0338 

54*08 

2*9690 

*76 

•4005 

*0139 

Stem Length 

13 

35-56 

2-0269 

35-42 

2*3970 

*46 

*4568 

*0129 

Shoulder Width 

14 

16*43 

1-8941 

15-36 

2*1379 

•44 

•2660 

•0286 

Chest Width 

15 

11-37 

1*3766 

11-23 

1*1396 

•41 

•3025 

-0361 

Hip Width 

15 

11-03 

1-5098 

11-07 

1*6774 

•18 

-2067 

•0163 

ChMt Depth 

15 

10-04 

*9165 

10-06 

•8648 

•29 

•2584 

•0289 

Head Ciroumferenoe 

15 

38-22 

3-4006 

38-15 

3*4699 

-21 

•1752 

*0055 

Chest Ciroumference 

16 

35-06 

3-6619 

36-09 

3*9633 

•66 

-6530 

*0181 

Transverse Head 

13 

9*97 

•6467 

9-98 

*6182 

*05 

•0480 

•0050 

Antero-jx^sterior Head 

14 

13-04 

1*0645 

13-04 

1-0667 

•04 

•0534 

*0031 


* The ages of these children are as follows : 2 weeks, 2 oases ; 1 month, 2 oases ; 2 months, 4 oases ; 
3 months, 6 cases ; 4 months, 1 case ; and 7 months, 1 ease. 


in measurement, the direction of the difference would fluctuate, with the females 
sometimes exceeding the males, and vice versa. 

As for the reliabilities of the judges* ratings of photographs, the correlations 
between the ratings of judges B and D range from *71 to *81, with a mean of *76 
(Table III). The correlations for months 15, 18, and 24 average *85 between B 
and W, and *78 between W and D. The combined ratings of all judges were 
correlated with themselves at consecutive ages, and these r*s (Table HI), when 
the necessarily wide age intervals between pictures are considered, are satisfactory. 
They range from *65 to *93 with a mean of *81, A coefficient of correlation is, of 
course, always to be interpreted in the light of the known facts with respect to 
such factors as (1) the size of the sample, (2) the degree of heterogeneity present 
in the sample, and (3) the extent to which the sample is selected and the kind of 
selection involved. For example, a selection of children from the poorer social 
strata might include a much larger proportion of disease-affected and under- 
nourished children than would be found in a sample from the more prosperous 
classes of a population. The latter group could very well be subjected to more 
standardised regimes and diets, and as a result of homogeneous environment have 
a smaller variability in measures. It seems probable that the present sample is, in 
the respects of diet and regime, more homogeneous than the population of the 
whole community. When we consider this, and the relatively small number of 
cases, these correlations, based on a seven-point rating scale, show a fair relation- 
ship between the ratings of the different judges ; and when the judges* ratings are 
combined into a single score, we have what we may consider a reliable criterion of 
body build against which to compare the objective indices. 
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TABLE III. 


Correlations between Judffes' Ratings of Chubbiness. 







Mo 






Mean 


1 

3 

7 

9 

10 

12 

15 

18 

24 

30 

36 

r 

JJ xD 

•77 

•73 

•76 

•79 

•79 

.. 

•81 

•79 

•71 

•74 

•73 

•76 

B xW 

— 

— 

— 

— 

— 

- 

•88 

•88 

•78 

— 

— 

•86 

WxD 


— 

— 

.. J 

1 

“ 

•77 

•79 

•78 

— 

— 

•78 


Self -Cor relations of combined Judges' Ratings. 


Months 

Moan 

1 & 3 

3 & 6 

6 & 7 

7 A 9 

9 & 12 

12 & 15 

15 & 18 

18 & 24 

•70 

•79 

•65 

•83 

•93 

1 

•85 

•86 

•86 

•81 


B. Growth trends of the Individual Bodily Dimensions. 

1 . Central Tendency. 

Table IV and Figs. 1 — 8 present the facts obtained in this seriatim study 
concerning the manner and relative rapidity of growth of the body -characters 
with which we deal in this paper. The curves of the means are all of the same 
general type, with total body length showing far more rapid acceleration than the 
other body dimensions. Stem length, chest circumference and head circumference 
arc similar in size and shape of curve throughout, though the latter shows smaller 
increments. Of the transverse measures, the shoulders are largest and grow the 
most. Next in size come hip width, chest width, and finally chest depth. Weight 
cannot be compared directly with these measures, but its curve is more nearly like 
that of height than any of the others. 

Figs. 4 — 7 present a comparison of this sample with other samples previously 
reported upon (4i). The data for the Growth Study, as already noted, are seriatim; 
all the other samples arc cross-sectional. As was to be expected in the light 
of Woodbury’s data, which showed Californian infants excelling those from other 
sections of the country in height and in weight, our sample, especially of the boys, 
is seen to average heavier and taller from the sixth month on than the samples 
from Iowa, New York City, and the United States at large. The Growth Study 
sample also averages somewhat taller and heavier than does the Californian sample 
above mentioned, except during the first six or eight months. During the first 
half-year the growth curves of all four groups are almost identical. 



TABLE IV. 

Table of Meana^ S*D,*e, P.E.*a of M earn and Coejfficienta of Variation of nine meaawrea 
made repeatedly on the aame Infanta from 1 through 36 moniha. 


(CentimetreB and kilograms.) 



■ 

■ 


Length 



Stem Length 



Head Gironmferenoe 

Monin 

■ 

No. 

M 

P.E.A, 

O’ 

V 

No. 

M 

P.E.JM 

a 

V 

No. 

M 

P.E.JW 

a 

V 

1 

Boys 

24 

65*45 

*317 

2*31 

4*16 

24 

36 00 

•168 

1*15 

3*18 

24 

38*56 

•164 

1*19 

3*09 


Girls 

26 

53*42 

*319 

2*41 

4*52 

26 

34*90 

*224 

1*69 

4*85 

26 

37*13 

*161 

1*22 

3*28 


Both 

50 

54*39 

*213 

2*23 

4*11 

60 

35*42 

*167 

1*66 

4*65 

50 

37*82 

•129 

1-36 

3*57 

2 

Boys 

Girls 

31 

58*86 

•222 

1*84 

3*12 

31 

38*64 

*267 

2*21 

6*71 

31 

40*11 

•137 

M3 

2*82 


27 

56*98 

*280 

2*16 

3*79 

27 

37*07 

*185 

1*43 

3*86 

27 

38*82 

*154 

1*19 

3*06 


Both 

58 

57*98 

*201 

2*27 

3*91 

58 

37*91 

*180 

2*03 

5*36 

58 

39*51 

•117 

1*32 

3*33 

3 

Boys 

31 

61*73 

*204 

1*68 

2*72 

31 

40*32 

*143 

1*18 

2*92 

31 

41*54 

•145 

1*20 

2*89 


Girls 

30 

59*72 

*276 

224 

3*76 

29 

38*98 

*187 

1*49 

3*83 

29 

40*20 

*133 

1*06 

2*63 


Both 

61 

60*74 

*193 

2*24 

3*69 

60 

39*67 

*128 

1*47 

3*70 

60 

40*89 

•118 

1*36 

3*30 

4 

Boys 

30 

64*11 

*227 

1*86 

2*88 

30 

41*82 

•130 

1*06 

2*52 

.30 

42*74 

*126 

1*03 

2*40 


Girls 

29 

62*37 

*224 

1*79 

2*88 

29 

40*85 

*215 

1*72 

4*20 

29 

41*40 

*142 

1*13 

2*73 


Both 

69 

63*25 

*184 

2*09 

3*31 

59 

41*34 

•136 

1*64 

3*73 

59 

42*08 

*114 

1*29 

3*07 

5 

Boys 

30 

66*63 

*263 

2*06 

3*09 

.30 

43*36 

*183 

1*49 

3-43 

30 

43*93 

*146 

1*18 

2*69 


Girls 

27 

64*67 

*307 

2*37 

3*67 

27 

41*81 

*246 

1*89 

4*52 

27 

42*46 

*157 

1*21 

2*85 


Both 

57 

65*60 

*218 

2*44 

3*72 

67 

42*62 

•172 

1*92 

4*62 

67 

43*23 

*124 

1*39 

3*21 

0 

Boys 

30 

68*48 

•266 

2*16 

.3*16 

30 

44*69 

*186 

1*61 

3*38 

30 

44*93 

*144 

117 

2*61 


Girls 

28 

66*38 

*272 

2*13 

3*21 

28 

4.3*18 

*187 

1*47 

3*40 

28 

43*28 

*148 

1*16 

2*67 


Both 

58 

67*46 

*219 

2*48 

3*67 

58 

43*91 

*146 

1*64 

3*73 

68 

44- 14 

*117 

1*32 

3*00 

7 

Boys 

26 

69*85 

•309 

2*34 

3*36 

26 

45*36 

•210 

1*69 

2*97 

27 

45*65 

*172 

1*33 

2*91 


Girls 

26 

68*04 

*357 

2*66 

3*89 

24 

44*19 

*281 

2*04 

4*62 

26 

44*.33 

*178 

1*32 

2*98 


Both 

61 

68*96 

•264 

2*69 

3*90 

60 

44*80 

•181 

1*90 

4*23 

62 

45*02 

*132 

1-41 

3*13 

8 

Boys 

26 

71*82 

*333 

2*47 

3*43 

25 

46*62 

*252 

1*87 

4*01 

25 

46*46 

*175 

1*29 

2*78 


Girls 

28 

69*68 

*355 

2*78 

3*99 

28 

46*11 

*236 

1*85 

4*10 

27 

44*98 

*167 

1*21 

2*69 


Both 

63 

70*69 

*263 

2*84 

4*01 

53 

45*78 

*179 

1*93 

4-21 

52 

45*69 

•138 

1-48 

3*24 

9 

Boys 

27 

72*89 

*297 

2*28 

.3*13 

27 

47*36 

*223 

1*71 

3*62 

27 

47*04 

*160 

1*16 

2*45 


Girls 

29 

70*34 

*314 

2*51 

3*66 

29 

46*43 

*200 

1*69 

3*61 

29 

45*36 

*162 

1*29 

2*85 


Both 

66 

71*67 

*244 

2*71 

3*79 

66 

46*36 

•173 

1*92 

4*14 

66 

46*16 

*141 

1*56 

3-39 

10 

Boys 

28 

74*28 

*287 

2*26 

3*03 

28 

47*90 

•218 

1*71 

3*56 

26 

47*66 

•168 

1*27 

2*67 


Girls 

27 

71*67 

*327 

2*52 

3*58 

27 

46*18 

•230^ 

1*78 

3*84 

22 

45*84 

•190 

1*32 

2-88 


Both 

65 

73*00 

*246 

2*70 

3*70 

55 

47*06 

*172 

1*89 

4*01 

48 

46-82 

*154 

1*58 

3*38 

11 

Boys 

27 

76*67 

•333 

2*57 

3*40 

27 

48*84 

*213 

1*64 

3*36 

25 

47*93 

•153 

114 

2*37 


Girls 

23 

73*23 

*388 

2*76 

3*77 

23 

46*94 

*272 

1*94 

4*12 

18 

46*63 

•242 

1*62 

3*27 


Both 

60 

74*49 

•281 

2*95 

3*96 

50 

47*97 

*188 

1*98 

4*11 

43 

47*34 

*159 

1*54 

3*26 

12 

Boys 

28 

76*89 

•323 

2*63 

3*30 

28 

49*16 

•242 

1-90 

3-86 

28 

48*50 

*166 

1*30 

2*68 


Girls 

28 

74*49 

*336 

2*63 

3*53 

28 

47*66 

•240 

1*88 

3*96 

27 

46-76 

*176 

1*36 

2*90 


Both 

56 

76*69 

•267 

2*86 

3*76 

56 

48*41 

*184 

2*04 

4*21 

66 

47*64 

•146 

1*60 

337 

15 

Boys 

27 

80*55 

•370 

2*86 

3*53 

27 

51*23 

*238 

1*83 

3-58 

27 

49*31 

*179 

1*38 

2*80 


Girls 

25 

77*66 

•372 

2*76 

3*56 

26 

49*00 

*242 

1*80 

3*67 

25 

47*62 

•148 

1*10 

2*32 


Both 

52 

79*16 

*296 

3*16 

3*99 

62 

50*16 

•197 

2*11 

4*20 

52 

48*46 

•143 

1*53 

316 

18 

Boys 

23 

83*18 

•470 

3*34 

4*02 

22 

62*09 

•266 

1*77 

3*40 

22 

49*82 

*187 

1*30 

2*62 


Girls 

24 

80*89 

•393 

2*86 

3*53 

24 

49*90 

*248 

1*80 

3*61 

23 

47*98 

*181 

1*28 

2*68 


Both 

47 

82*01 

•326 

3*31 

4*04 

46 

60*96 

*206 

206 

4*06 

45 

48*88 

•159 

1*58 

3*23 

24 

Boys 

24 

88*44 

*462 

3*28 

3*71 

24 

63*60 

*267 

1*87 

3*49 

24 

60*73 

•197 

1*43 

2*83 


Girls 

19 

86*08 

*313 

2*02 

2*.35 

19 

61*18 

*186 

1*20 

2-34 

18 

49*03 

*184 

1*16 

2*36 


Both 

43 

87*40 

•308 

2*99 

3*42 

43 

52*48 

•196 

1*91 

3*63 

42 

50*00 

*165 

1*59 

3*18 

80 

Boys 

25 

93*05 

*669 

4*14 

4*46 

25 

54*19 

*259 

1*92 

3-54 

25 

61*02 

•179 

1*33 

2*60 


Girls 

21 

90*71 

*468 

3*18 

3*51 

21 

52*66 

*295 

2*00 

3*80 

21 

49*26 

*180 

1*22 

2*48 


Both 

46 

91*98 

*391 

3*93 

4*28 

46 

53*49 

*211 

2*12 

3*96 

46 

60*22 

•149 

1-49 

2-97 

36 

Boys 

Gim 

25 

24 

97*38 

05*02 

*588 

*457 

4*36 

3*32 

4*48 

3*49 

26 

24 

66*68 

53*46 

•316 

•239 

2*34 

1*74 

4*21 

3*26 

23 

24 

61*30 

49*72 

•187 

*143 

1*83 

1*04 

2*59 

208 


Both 

40 

96*22 

*396 

4*11 

4*27 

49 

54*59 

*229 

238 

4*36 

47 

50*49 

•146 

4*49 

2*94 
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TABLE IV — (ooTttinued). 


(Centimetres and kilograms.) 


Month 


Shoulder Width 

Chest Width 

Chest Depth 


No. 

M 

P.E.Af 

a 

V 

No. 

M 

P.E.jkf 

or 

V 

No. 

M 

P.E.jf 

a 

V 

1 

Boys 

23 

14-86 

-091 

•65 

4-38 

22 

10-46 

•111 

•77 

7-38 

24 

9-89 

•083 

•60 

6-07 


Girls 

26 

1410 

•082 

•02 

4-38 

25 

10-16 

•101 

•75 

7-37 

26 

9-70 

•081 

•61 

6-33 


Both 

49 

14-46 

•068 

•71 

4-88 

47 

10*30 

•073 

•74 

7-21 

50 

9-79 

•059 

•62 

6-38 

2 

Boys 

31 

15-94 

•092 

•76 

4-75 

31 

1108 

•127 

1-05 

9-47 

31 

10-35 

•080 

•66 

6*4 


Girls 

27 

14-81 

•143 

1-10 

7-45 

26 

10-88 

•091 

•67 

G-16 

27 

10-21 

•090 

•69 

0-77 


Both 

58 

15-41 

•099 

1-12 

7-26 

56 

10-99 

•082 

•91 

8-28 

68 

10-28 

•063 

•71 

6-96 

3 

Boys 

30 

16-75 

•072 

•68 

3-47 

30 

11-97 

•139 

1-13 

9-44 

30 

10-67 

•082 

•67 

6-34 


Girls 

28 

16-12 

-082 

•64 

4-00 

28 

11-31 

•091 

•71 

6-29 

27 

10-18 

•088 

•67 

6-62 


Both 

58 

16-44 

•067 

•76 

4-60 

58 

11-65 

•090 

1-02 

8-72 

57 

10-38 

•066 

•74 

7-10 

4 

Boys 

30 

17-46 

•091 

•74 

4-25 

.30 

11-89 

•118 

•96 

8-08 

30 

10-68 

•074 

•60 

6-59 


Girls 

29 

17-10 

•111 

•89 

6-19 

29 

12- 12 

•108 

•86 

7-10 

29 

10*60 

•101 

•81 

7-64 


Both 

59 

17-28 

•076 

•87 

6-02 

59 

12-01 

•077 

•87 

7-28 

69 

10-64 

•063 

•72 

6-76 

6 

Boys 

30 

18-25 

•134 

1-09 

5-98 

30 

12-76 

•121 

•98 

7-68 

30 

11-01 

•069 

•48 

4-32 


Girls 

26 

17-83 

•151 

115 

6-42 

26 

12-36 

•076 

•68 

4-66 

26 

10-72 

•070 

•63 

4-93 


Both 

56 

18-05 

•106 

1-17 

6-49 

56 

12-67 

•078 

•87 

6-95 

56 

10-87 

•062 

•68 

5-29 

0 

Boys 

30 

18-98 

•117 

•95 

4-99 

30 

13-26 

•133 

1-08 

8-16 

29 

11-11 

•088 

•70 

6-29 


Girls 

28 

18-43 

•148 

1-16 

6-32 

28 

12-88 

•119 

•9.3 

7-26 

28 

10-87 

•077 

•61 

5-59 


Both 

58 

18-72 

•093 

1-06 

6-68 

58 

13-08 

•084 

•94 

7-21 

67 

10-99 

•061 

•68 

6-22 

7 

Boys 

27 

19-46 

•128 

•99 

5-08 

27 

13-67 

•102 

•79 

6-76 

26 

11-32 

•085 

•64 

6-66 


Girls 

24 

19-01 

•149 

1-08 

6-68 

24 

13-45 

•118 

•86 

6-39 

25 

10-90 

•096 

•71 

6-43 


Both 

51 

19-25 

•098 

1-04 

6-41 

61 

13-67 

•076 

•80 

6-90 

51 

11-16 

-061 

•64 

5-77 

8 

Boys 

25 

19-91 

•165 

1-23 

6-15 

26 

14-26 

•138 

1-02 

7-15 

25 

11-35 

•066 

•41 

3-64 


Girls 

26 

19-43 

•129 

•98 

5-02 

26 

1.3-81 

•116 

•87 

6-31 

26 

11-37 

•113 

•86 

7-62 


Both 

51 

19-66 

•111 

1-18 

6-98 

51 

1403 

•092 

•98 

6-97 

51 

11-36 

•064 

•68 

6-97 

0 

Boys 

27 

20-33 

•142 

1-09 

5-37 

27 

14-35 

•125 

•96 

6-69 

26 

11-67 

•082 

•62 

5-34 


Girls 

29 

19-64 

•144 

115 

6-84 

29 

14-06 

•116 

•92 

6-63 

29 

11-45 

•092 

•74 

6-42 


Both 

56 

19-97 

•108 

M9 

6-98 

66 

14-20 

; -086 

•96 

6-68 

55 

11-61 

•060 

•66 

6-71 

10 

Boys 

25 

20-74 

•151 

1-12 

! 5-39 

25 

14-76 

•126 

•94 

6-33 

26 

11-98 

*086 

•66 

5-41 


Girls 

22 

19-93 

•181 

1-26 

1 6-32 

22 

14-18 

•149 

1-04 

7-33 

22 

11-67 

•090 

•62 

5-34 


Both 

47 

20-36 

•124 

1-26 

618 

47 

14-49 

•100 

1-02 

701 

48 

11-84 

•043 

•44 

3-76 

11 

Boys 

24 

21-00 

•149 

1-08 

5-14 

24 

14-86 

•121 

•88 

6-92 

26 

11-96 

•082 

•62 

6-16 


Girls 

19 

20-25 

•199 

1-29 

6-37 

19 

14-73 

•127 

•82 

6-67 

19 

11-59 

•100 

•65 

5-69 


Both 

43 

20-67 

•126 

1-22 

6-91 

43 

14-80 

•090 

•88 

6-93 

45 

11-81 

•060 

•60 

606 

12 

Boys 

25 

21-09 

•141 

1-04 

4-94 

28 

16-10 

•120 

•94 

6-24 

28 

1217 

•088 

•69 

5-67 


Girls 

26 

20-47 

•121 

-91 

4-46 

27 

14-90 

•093 

•72 

4-83 

28 

11-88 

•064 

•50 

4*20 


Both 

51 

20-77 

•105 

111 

6-36 

55 

15-00 

1 -079 

•86 

6-76 

56 

12-02 

•060 

•67 

5-54 

15 

Boys 

27 

21-48 

•172 

1-33 1 

6-18 

27 

15-98 

•120 

•93 

5-79 

26 

12-29 

•086 

•66 

5-28 


Girls 

25 

20-71 

•124 

•92 

4-42 

25 

15-28 

•089 

•66 

4-31 

25 

12-11 

•083 

•62 

609 


Both 

52 

21-11 

•113 

1-21 

6-73 

62 

15-64 

•085 

•91 

5-82 

51 

12-20 

•062 

•66 

! 6-38 

18 

Boys 

22 

22-18 

•154 

1-07 

4-81 

22 

16-06 

•123 

•86 

5-34 

22 

12-48 

•124 

•86 

690 


Girls 

22 

21-40 

•124 

•86 

4-03 

22 

16-63 

•116 

•81 

6-19 

23 

12-35 

•093 

•66 

5-32 • 


Both 

44 

21-79 

•107 

105 

4-80 

44 

16-79 

•093 

•92 

5-81 

45 

12-42 

•072 

•71 

5-73 

24 

Boys 

23 

22-59 

•163 

115 

5-11 

22 

16-45 

•144 

100 

6-10 

23 

12-77 

•116 

•82 

6-41 


Girls 

18 

21-99 

•165 

104 

4-73 

19 

15-92 

•105 

•68 i 

4-24 

19 i 

12-17 

•114 

•74 

605 


Both 

41 

22-33 

•117 

Ml 

4-97 

41 

16-20 

•099 

•95; 

5-84 

42 j 

12-60 

•086 

•83 

6-62 

30 

Boys 

25 

22-51 

•218 

1-62 

7-18 

24 

16-96 

•115 

•83 

4-90 

26 

12-68 

•148 

110 

8-66 


Girls 

21 

22-20 

•130 

•88 

3-97 

21 

16-26 

•088 

•60 

3-69 

21 

12-21 

•128 

•87 

7-12 


Both 

46 

22-37 

•132 

1-33 

5-94 

45 

16-63 

•085 

•86 

508 

46 

12-46 

•107 

107 

8*59 

36 

Boys 

25 

23-54 

•132 

•98 

4-16 

23 

17-20 

•122 

•87 

505 

23 

13-00 

•127 

•90 

6-93 


Girls 

24 

23-08 

•123 

•89 

3-87 

24 

16-65 

•096 

•69 

419 

24 

12*42 

•083 

•61 

4-87 


Both 

49 

23-32 

•089 

•92 

3-94 

47 

16-87 

•082 

•33 

4-93 I 

1 

47 

12*70 

•084 

•85 

6-68 




Nantot Batlbt aho Fbank C. Datis 
TABLE IV— (oon<»nu0dX 

(Oentimetrea and kilograma.) 
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CeriTimcTers 



Curves of Growth in Seven Body Dimensions (Sexes Combined). 
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TABLE V. 

Birth Measurements, 


Measure 

Sex 

No. 

Mean 

Median 

S.D. 

Range 

Head Circumference 

Bovs 

28 

35-52 

35-40 

1-1499 

33-3 to 39-0 


Girls 

25 

34-52 

34-50 

1-3778 

31-3 to 37-0 


Both 

53 

35-05 

3600 

1-8801 

31-3 to 39-0 

Chest Circumference 

Boys 

28 

33-60 

33-45 

1-3919 

31-0 to 36-3 


Girls 

24 

33-51 

33-75 

2- 1703 

29-5 to 38-5 


Both 

52 

33-50 

33-55 

1-8365 

29-5 to 38-5 

Transverse Head 

Boys 

28 

9-53 

9-50 

-6559 

8-6 to 10-4 

JMamoter 

(rirla 

25 

9-23 

9-20 

-5709 

8-2 to 10-4 


Both 

53 

9-39 

9-40 

-4884 

8-2 to 10-4 

Anterior-Posterior Head 

Boys 

28 

12-28 

12-30 

-5283 

11-3 to 13-4 

Diameter 

Girls 

25 

11-90 

12-00 

-6242 

10-4 to 13-3 


Both 

53 

12-10 

12-10 

-6131 

10-4 to 13-4 

Weight (lb.) 

Boys 

31 

8-17 

8-19 

-6971 

6-75 to 9-63 


Girls 

30 

7-37 

7-19 

1-2828 

5-38 to 10-56 


Both 

61 

I 7-78 

7-88 

1-1024 

5-38 to 10-56 

Weight (kg.) 

Boys 

31 

1 

3-70 

3-71 

-32 

3-06 to 4-36 


Girls 

30 

3-35 

3-27 

•58 

2-44 to 4-79 


Both 

61 

3-52 

3-58 

•50 

2-44 to 4-79 


A comparison with the measures of the Iowa Child Welfare Research Station 
<26) is especially interesting, since the same measurements were made, and at the 
Siime ages, in the two studies. Also, the number of cases for each age is approxi- 
mately the same for both groups. The Iowa data contain many remeasures of the 
same children, but since the samples were not limited to children on whom seriatim 
measures were made, the samples for the different ages arc essentially cross- 
sectional. The growth curves for all measures are closely similar for the two 
groups, except that for reasons already considered, the Growth Study curves are 
comparatively very smooth. 

There is very little difference in the size of these two groups, except that the 
California babies usually measure slightly larger after six months. The greatest 
difference is in weight. 

The mean weight and height of approximately 200 Berkeley infants (a repre- 
sentative sample) for months 12, 21 and 24 are almost identical with those of the 
smaller group of Berkeley infants we have been considering. The mean weights 
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of the larger group of infants are 10*62, 12-30 and 13*11 kg. for 12, 21 and 24 
months, respectively. For these same ages their mean lengths are 76*18, 84*84 
and 87*88 cm. Whatever selective factors are operating to make the Growth Study 
infants taller and heavier than infants in other parts of the country, are also 
effective for the population of Berkeley as a whole. 

When the growth trends of the various dimensions are made more directly 
comparable with each other by converting each measure into the percentage of 
itself at one month (Figs. 8 — 11), the measure which far outstrips all others is 
weight. This, of course, is to be expected, as it is a volumetric measure. 

An inspection of the proportional growth curves of the linear measures reveals 
very interesting differences in the rates of increment of the different dimensions. 
Throughout the first 18 months (when growth in weight is most rapid) the 
measures which show the greatest increase are width measures — hips, shoulders, 
and chest — and it is not until the 24th-month records that we find the increments 
for height greater than those for the three widths. As for the height measures, 
the first year shows practically equal proportionate growth for the stem length and 
for the total length. It is only after the 1 2th month that the legs begin to grow 
rapidly and cause a relatively greater increase in the total height curve. The 
dimension which shows the smallest percentage increment throughout the entire 
36 months, is chest depth. Head circumference shows the second smallest 
increment. 

In general, though not consistently throughout, the cephalo>caudal direction 
of growth is followed in the percentage growth curves. The head size is relatively 
more mature at birth than are other measures and its postnatal growth is small ; 
in the early months growth in shoulder width is relatively more rapid than chest 
width, and this position is reversed later ; stem and leg length grow at about the 
same rate at first, but after the first 18 months, leg length is the most rapidly 
increasing dimension measured. 

The changing body proportions, then, are for the first year due mainly to 
relatively greater growth in transverse widths, and relatively small growth in head 
size ; while for the second and third years, the decrease in growth rate of all 
dimensions except total height results in changed body proportions due to rapidly 
increasing leg length. We find, especially during the first year, not only changes 
in the relative length of body parts, but also in the relative widths ; and indices 
of body build should be formulated with this fact in mind. 

C. Variability. 

When we consider the variation from the mean curves of growth, the coefficients 
of variation (Table IV, pp. 37 — 39) show no age changes for any of the measures 
except two. Chest width and hip width decrease slightly but consistently in 
variation, during the three-year period. 
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Figs. 6 — 11. Percentages of First Month’s Measures achieved at liater Ages. 
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There are no mgnificant sex differences in variation. Although the coefficients 
of the girls axe slightly larger in ninety-three comparisons, while the boys are 
larger in only fifty-seven comparisons, none of the differences is significant. The 
only measure in which the boys F’s are greater fof most ages is chest width, and 
since this one measure shows a contrary tendency to all the others in this 
respect, it may possibly be indicative of a real sex difference. There is also a 
slight suggestion that the girls are possibly becoming more homogeneous in length 
and in chest depth, toward the end of this three-year period. 

Montague and Hollingworth (88) found no tendency for sex differences in 
variability for seven measurements made on 2000 newborn infants. The measure- 
ments of the Iowa Child Welfare Research Station ( 86 , 87 ) from one month through 
six years show no consistent sex differences in variation, with the possible ex- 
ceptions of weight and hip width in the later ages. Both these studies do show, 
however, consistent tendencies for the boys to be larger. It seems unlikely, then, 
that any slight differences in the coefficient of variation found in the present 
group represent sex differences. The group is, however, typical of children, in 
general, so far as sex differences in sisse are concerned. 

There are only slight tendencies for the standard deviations to increase with 
age, with the exception of weight, the standard deviation of which increases from 
*54 kg. at month 1 to 1-72 kg. at month 86. 

D. Consistency of Growth. 

When we select, from the 36-month measures, the tallest boy and girl and the 
shortest boy and girl, and plot their individual curves of growth in length (Fig. 12), 
we find that the tall children have been consistently tall from the first month, and 
the short children consistently short. The extent to which this is true for the 
entire group in length and in other dimensions may be shown by correlation 
coefficients between different ages for the same dimension. Tables VI — XV give 
the correlations between the measures of these children at different ages. Head 
circumference, length and stem length are most consistent: the correlations 
between the 1-month and 36-month measures are, respectively, *72, '68 and *59. 
Weight comes fourth, with correlations which are considerably lower, the r for 
the same interval being only *40. There is, however, a marked increase in the 
consistency of this measure after the first year. The width measures of hip and 
shoulder diameters and chest circumference, equally low in consistency, all show 
the same phenomenon as does weight — a decidedly greater consistency after the 
first year. Chest width and depth, relatively unreliable measures, show low con- 
sistency and only small increase in consistency with growth. 

The width measures become more consistent after their period of relatively 
rapid growth has stopped, and weight, which is largely infiuenced by the width 
measures, exhibits the same trend. 
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Self -Correlations showing Consistency of Growth. 


TABLE VI. 


In Length. 


Month 

3 

6 

12 

IS 

24 

30 

36 

1 

•87 

•77 



•68 


•68 

3 

— 

•84 

— 

— 

•71 

— 

•67 

« 


— 

•91 


•83 

— ! 

•79 

12 

— 

— 

— 

•91 

•93 

— 

•88 

18 

— 

— 


I 

•92 

— 

•89 

24 

— 

— 

, — . 



— 

•96 

•92 

30 

— 

— 

— 

' — 

— 

— 

•97 


TABLE VII. 


In Weight. 


Month 

3 

— 

6 

12 

18 

24 

30 

36 

1 

•82 

•54 



•48 


•40 

3 


•80 

— 

— 

•62 

— 

•48 

6 

— 

— 

•72 

— ! 

•66 

— 

•61 

12 

— 

— 

— 

•92 

•91 

— 

•80 

18 

— 

— 

— 

— 

•93 

— 

'88 

24 

— 

— 

— 

— 

— 

•94 

•95 

30 

1 

1 

— 


— 

— 

— 

•96 
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TABLE VIII. 


In Head Oireum/erence. 


Month 

6 

12 

24 

36 

1 

•79 

*71 

•74 

•72 

6 

— 

*86 

gill 

*81 

12 

— 

— 

*91 

*93 

24 

— 

— 

— 

•92 


TABLE IX. 


In Stem Length. 


Month 

6 

12 

24 

36 

1 

•66 

•63 

•45 

*59 

6 

— 

•79 

*71 

•68 

12 

— 

— 

*86 

*74 

24 

— 

— 

— 

•76 


TABLE X. 


In Chest Circumference. 


Month 

6 

12 

24 

36 

1 

•25 

•29 

*26 

•29 

6 

— 

•44 

*41 

•43 

12 

— 

— 

*73 

*69 

24 

j 

— 

— 

•82 


TABLE XI. 


In Chest Width. 


Month 

6 

12 

24 

36 

1 

•14 

•19 

-•01 

•23 

6 

— 

•63 

•37 

*20 

12 

— 

— 

•64 

•61 

24 

— 

— 

— 

•71 

j 


TABLE XIII. 

In Shoulder Width, 


Month 

6 

12 

24 

36 

1 

•10 

•33 

•13 

•21 

6 

— 

•46 

•33 

-.36 

12 

— 

— 

•68 

•66 

24 

— 

— 

— 

•71 


TABLE XII. 


In Chest Depth. 


Month 

6 

12 

24 

36 

1 

•67 

*40 

•26 

•12 

6 

— 

•63 

•61 

•42 

12 

— 

— 

•46 


24 

— 

— 

— 

*63 


TABLE XIV. 
In Hip Width, 


Month 

6 

12 

24 

36 

1 

•46 

•37 

•32 

•23 

6 

— 

*48 

*60 

•45 

12 

— 

— 

•80 

•83 

24 

— 

— 

— 

•83 


TABLE XV. 


Self -‘Correlations showing Consistency according to Sex. 


Measure 

- . i 

Month 

6 A 12 

12 A 24 

24 & 36 

Boys 

Girls 

Total 

Boys 

Girls 

Total 

Boys 

Girls 

Total 

1 

L^igth 

•89 

•86 

•91 

•94 

•88 

*93 

*93 

•87 

*92 

Weight 

*66 

•78 

•72 

*89 

•97 

*91 

*95 

*92 

*96 

HipWidth 

•33 

•57 

•48 

•86 

•70 

*80 

*84 

•84 

•83 


*53 

•51 

•44 

•46 

*68 

*74 

*76 

•77 

•73 
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l^’he above correlations have all been computed for the total group, regardless 
of sex. The division into sex groups seemed unnecessary, since there were no 
significant differences in the growth tendencies of the two sexes. As a further 
check, however, consistency correlations were computed on the sexes separately 
for three measures, weight, length, and hip width, and the W./L.* index, for three 
age intervals (Table XV). These correlations give no evidence of sex differences in 
consistency. 


E. Interrelationships among the Measures. 

Inter-Correlations between Individual Dimensions. 

The correlations between the measures have been computed for representative 
ages. Table X VI gives these correlations, ranked from highest to lowest, in the 
order of the mean of the r's between each pair. There is a tendency for the higher 
inter-correlations to be between weight and one of the more reliable width mea- 
sures; and between the two length measures (which rank second and fourth in 
reliability). The three combinations which correlate most highly arc ; hip width 
and weight, with a moan r of *84 ; stem length and total length, with a mean r 
of *82 ; and chest circumference and weight, with a mean r of *80. We find, also, 
fairly high relationships between stem length and weight, and total length and 
weight (*74). It is to be expected that weight should correlate highly with the 
various diameters and circumferences, since weight is a composite which includes 
them all, and since they are all positively correlated with each other. 

TABLE XVI. 


Inter-Correlations of Measures. 







Month 





Mean 












1 

3 

6 

9 

12 

15 

18 

24 

36 


H.W. X W. 

•s<m 

•807 

•816 

•791 

•821 

•852 

•875 

•812 

•864 

•836 

L. X 8.L. 

•742 

•784 

•810 

•844 

•879 

•882 

•797 

•855 

•791 

•820 

ac. xVV. 

•746 

•849 

•791 

•844 

•792 

•851 

•8.57 

•698 

•770 

•796 

S.L. xW. 

•72» 

•786 

•685 

•736 

•782 

•804 

•707 

•730 

•732 

•743 

h. X W.* 

•818 

•708 

•651 

•734 

•777 

•795 

•734 

•808 

•813 

•739 

S.W. xW. 

•671 

•748 

•718 

•718 

•670 

•696 

•663 

•707 

•667 

•604 

O.C. xH.W. 

•704 

•711 

•625 

•706 

•641 

•732 

•817 

•652 

•634 

•691 

S.W. xC.(^ 

•479 

•607 

•712 

•691 

•628 

•7tK) 

•613 

•604 

•647 

•641 

S.W. xH.W. 

•624 

•586 

•467 

•613 

•469 

•635 

•697 

•611 

•647 

•694 

L. x8.W. 

•587 

•670 

•642 

•562 

•536 

•570 

•571 

•658 

•496 

•688 

L. X H.W. 

•598 

•489 

•336 

•491 

•649 

•655 

•636 

•666 

•732 

•584 

H.W. x S.L. 

•644 

•579 

-.393 

•402 

•635 

•716 

•668 

•584 

•601 

•580 

0.0. xH.L. 

•557 

•627 

•493 

•607 

•582 

•678 

•632 

•602 

•637 

•579 

L. X O.C. 

•498 

•546 

•538 

•648 

•595 

•634 

•629 

•669 

•560 

•672 

S.W. xS.L. 

•453 

•587 

•496 

•496 

•544 

•628 

•491 

•513 

•633 

j 

•527 


^ Additional oorrelations between length and weight have been computed. They are for months 
2, 4, 6, 7, 8, 11 and 80 and are, in order, *762, *644, *658, *707, *727, *768, and -719. These have been 
included in computing the mean of the length-weight correlations. 
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The r's of *69 between chest circumference and hip width, and of ‘64 between 
chest circumference and shoulder width, indicate a fair amount of symmetiy in the 
transverse trunk measures, but by no means perfect symmetry. The length-width 
correlations are the lowest ones and fall in the last six places in the rank order 
table. These r's range from *53 to ‘69. One should expect, then, to find in this 
group tall slender infants and short chubby ones, as well as the more frequently 
occurring large, medium and small sizes of symmetrical proportions. In general, 
there is a clear tendency for all measures to be positively related. Large children 
are for the most part large in all their measures, and small children small in all 
their measures. 

As for age trends in these inter-correlations, there is a tendency for some 
of the coefficients to decrease at and near six months. This is true of the cor- 
relations between weight and length, hip width and length, and between hip width 



and stem length. The same tendency is present — ^but less clear-cut — in the cor- 
relations between shoulder width and hip width. Some other coefficients tend 
to decrease slightly after month 18. Among these are the correlations between 
weight and chest circumference, length and chest circumference, hip width and 
chest circumference, and between length and stem length. 

Boas and Wissler(i2) contend that correlations between measures are high 
during periods of rapid growth, and conclude from this that when rapid growth 
occurs in one part of the body, it occurs also in other parts. Berkson ( 9 ) has com- 
puted the inter- correlations between height, weight and chest circumference for a 
large number of cases, and concludes that the alternating points of high and low 
correlations which he finds correspond with critical periods of growth. His data 
for the first six years are taken from Woodbury ( 41 ), and this part of his curve 
of length-weight correlations is given in Fig. 13 with the correlations obtained 
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for the present group. The Growth Study curve for length-weight correlations 
shows high r's at months 1 and 15, with a consistent drop from 1 to 4, and a rise 
from 7 to 15, followed by variations showing no directional trend between 15 and 
36 months. These r& difiFer from the Woodbury data in being high at month 1, 
with a directional trend of decreasing rs from months 1 to 4^, and no significant 
trend toward decrement after month 15. 

A mimber of other investigators have computed r*s between body measures, 
and their findings may be compared with ours. Taylor’s <36) correlations between 
weight and length of 240 newborn infants are '79 for boys and *81 for girls — very 
close to our one-month correlation of *82. 

The correlations between weight and length given by Gray and Ayres for year 
groups, starting at one year, f^^il, as ours do, to show a decreased r at three years. 
Instead there is a rise from one to three followed by a drop at five years. 

Baldwin (6) computed intcu -corrclations for about 120 children at each year from 
seven to seventeen years. He found that the correlations between different body 
measures werti higher for the boys than for the girls, and that they were higher 
for both sexes in early adolescence, and lowest at 17 years. His length- weight 
correlations show trends very similar to those of Berkson. As has been already 
mentioned. Boas and Wissler(i2) also find increased correlations during rapid pre- 
adolescent growth. Wallis ( 39 ) gives correlations between pelvic width and length 
of a group of New York boys from three to eight years. These are plotted in 
Fig. 14 with the length -hip width rs for the present group. Although the length- 
pelvic width r s of Wallis are smaller at throe years than our length-trochanter 
width r’s, they show the same directional trend of increasing correlation with age 
that is indicated in our sample after month 6. 

The Woodbury measures, used by Berkson, though for large samples, were 
collected from many sources with the possibility of great differences in technique and 
reliability of the measures. In addition, it was necessary to compute the correlations 
from class groups in Woodbury’s tables instead of from the original data. Since 
the measures of the Growth Study have been made by a single observer with a 
consistent method, and since the same children have been measured at the successive 
age levels, and since none of the other correlations in the studies cited show 
decreased r’s at three years, this curve may well give the truer approximation of 
the changes in the body relations which occur during growth in the individual 
through the first three years. 

So far, then, as the evidence for this short age period is concerned, any age 
differences in the inter-correlations of the body measures seem to be related to 
differences in the relative growth rates of the parts measured, and perhaps to 
individual differences in tendency to increased chubbiness during the first nine or 

* There ie Bome evidence for a similar trend in the Woodbury data, but in smoothing his curve 
Berkson does not accept this. 
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eleven months. Length-width correlations tend to drop from months 1 to 6 
when growth in widths is more rapid and more variable among individuals^ and then 
to rise again as the disparity in growth rates of the body parts involved diminishes. 
After eighteen months, the growth in chest circumference is very slow, and the 
correlations between it and the still rapidly growing measures — weight, length, hip 
width and stem length — tend to drop. For this sample, during the first three years, 
growth changes in correlation between the various body parts show more relation 
to the amount of disparity between rates of growth in these parts, than to successive 
periods of rapid and slow growth of the body as a whole. The situation may be 



very different at later ages when body proportions are changing less radically. But 
it is also possible that the high adolescent correlations between body measures are 
due to the wider variation of size at this time when some children start rapid 
growth at a younger age than do others. Boas (lO) has found that, although there is 
greater variability in the size of adolescents than in older or younger children, 
‘"the maximum of variability of stature during adolescence disappears in groups 
having the same moments of maximum rate of growth.*’ If the correlations were 
computed between measures on children of the same age of maximum rate of 
growth, rather than grouping together all children of the same age, the high 
adolescent r*s would probably also disappear. 
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F. Body-build. 

The recognised desirability of a quantitative measure of body-build which 
should reliably and adequately represent the empirically observed variations in 
human structure has been productive of much debate, and of a considerable amount 
of research as well, from the time of Quetelet up to the present. The evidence recently 
brought forward by Kretschmer, pointing to a close relationship between bodily 
proportions and tempemmental predispositions, lends added significance to the 
the search for an objective index of ))uild. Among the indices which have been 
projx)sed are the ratios of weight to height, weight to the square of height 
(Quetelet, Davenport), weight to the two and one-half power of height (Quetelet), 
weight to the cube of height (Rohrer and Bardeen), chest circumference to height 
(Davenport), hip width to height (Pryor and Lucas, Gray), stem length to height 
(Gray, Hejinian and Hatt), and many others. Objections have been raised to the 
use of an index which includes weight as one of the factors (Wertheimer and 
Hesketh ( 40 )) on the ground that weight fluctuations may be of serious extent, even 
over a relatively small span of time, as a result of a variety of exogenous influences. 
This objection, however, while it may be of moment with subjects such as those 
used by Wertheimer and Hesketh (adult psychotics), may nevertheless be relatively 
inconsequential with subjects who are free from any serious ailment, either physical 
or behaviouristic. The self-correlations in Table I (p. 34 above) indic^ite that this 
objection is not important in the present sample. With infants and with children, 
at least, indices involving weight as one factor should not be dismissed on a pHoH 
grounds, because weight is one measure in the recording of which the personal 
equation plays a I'elatively minor r61c. We shall see that the superior reliability 
of measurements of body weight, with infant and child subjects such as those used 
in the present investigation, gives indices which are based in part upon weight 
measurements a definiti^ advantage over various other proposed indices which 
incorporate measurements difficult to obtain, at these early ages, with a sufficiently 
high degree of accuracy. 

Several diftbrent indices have been computed from the measures taken on the 
Growth Study children, and their means are given in Table XVII (p. 56) and 
Figs. 15 — 22. These curves exhibit interesting age trends in body proportions. 
As would be expected from the proportionate growth curves in Fig. 9, the relative 
hip width (Fig. 15) increases through the sixth month, but declines again, rapidly, 
after the seventh month. If we eliminate the leg measures by comparing hip width 
to stem length (Fig. 10) the initial rapid increase of hip width is still present, but 
after a short decrease between the seventh and ninth months, the relationship 
between the two measures remains constant. The ratio of stem length to total 
length, after a slow decrease through the ninth month, falls off very rapidly after 
this age. The same may be said of the ratios of chest circumference to length, and 
of chest circumference to stem length. 

The length-weight indices vary greatly, according to the power of length used 
in computing the index. The simple W./L. ratio shows a growth curve of increasing 
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weight in relation to length, rapid at first, but slowing up in the second and third 
year, W./L.* — a ratio which has been recommended on the ground that weight is 
a volumetric measure — shows a decreasing index which is rapid after the ninth 
month. W./L.* — an index which has often been used — shows relatively increasing 
weight through the eleventh month, with a more gradual decrease after this age*. 
All three of these indices yield results on the present sample which are closely 
similar to age curves of the mean indices of other groups presented elsewhere. 
Davenport ( 14 ) gives, in tabular form, age changes in mean indices for these three f 
which are very similar to ours, and our W./L.* means are almost identical with 
those for the first three years derived from Quetelet’s data ( 84 ). 


[* At mj saggeBtion the authors worked out the value of J^=£W./(0.0. x H.W. x L.) as an index. 
It will be seen that this is an attempt to obtain weight divided by a product of three linear quantities, 
i.e. chest circumference, hip width and length. I did not suggest chest depth, as the authors considered 
it a not very reliable measure. Thus roughly we have an index of weight per volume of the body, or we 
have an index which seemed to me the nearest approach to what 1 should call chubbiness — a term 
not defined by the authors. Dr Bayley writes to me; “Chubbiness*’ as we used it in our ratings of 
photographs meant the tendency to be wide, relative to length, rather than to have excess fatty tissue, 
though this latter factor had a strong influence on the rating.” (See below, p. 71.) The authors obtained 
the following results : 


Index W.fiC.C, x U.W. x L.) 




Age in months 



1 

5 

12 

18 

36 

Index 

197 

181 

181 

172 

i 

170 


When these values are plotted it will be seen that the index decreased slowly with age, but in an almost 
linear manner. 2,1 + *6431 x (age in months) is a function which on the average is nearly constant, 
= 189*46, for the first thirty-six months of life. It thus appears to be more satisfactory than several of 
those graphed in Figs. 15 — 22, the most nearly linear of which is probably W./l*, an index which we 
might anticipate would have a fairly high correlation with the present index. was found, however, to 
have no correlation with the judges’ ratings of ** chubbiness,” nor with the index on which the 

writers lay stress. Now this latter index may be read as W./Z^ x Z, and this seems to indicate that a 
volumetric index of chubbiness should have no relation to length or absolute sixe of the infant, which 
I take it should be true for a real definition of “ chubbiness.” That this ^Mndex of chubbiness” has no 
relation to the judges’ ratings of ** chubbiness,” 1 should, 1 am afraid, be inclined to account for by the 
fact that the judges have unconsciously paid considerable attention to size (Z) in their estimate of 
“ chubbiness,” which accounts for the correlation between their ratings and the index W./Z^x Z = W./Z‘^. 
Anyhow, if chubbiness equals plumpness, I think it should be practically independent of size, and that 
a reasonable scientific definition of it would be weight per unit of volume. Short of actually measuring 
the volume of the body by immersion — practically impossible with infants — we have to find rough 
measures of the volume of the body, or take a product of three linear dimensions which may reasonably 
be supposed proportional to it. I may have a different opinion from the authors of what is meant by 
** chubbiness,” but 1 think at any rate we need a scientific definition of the term. I personally am 
inclined to identify it with average bodily density, and thus to free it from any a priori relation to size ; 
we may afterwards show that, as is the case with the present index, density appears to decrease 
physiologically with age. For children of the same age, I should anticipate little relation of density to 
size. If there be, we may be running up against some other very important physiological fact. Ed.] 
t These tables are based on tables given by Bardeen (7), using the data of Sohmid-Mounard and of 
Quetelet. 



TABLE XVn. 

Table of Means for Boys, Girls, and Total Group of Eight Body-build Indices from 1 through 36 months. 
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Most of these indices show rapid growth changes in body proportions, vaiying 
with the rates of growth of the parts compared. They are in accordance with our 
other findings, that early postnatal growth is most rapid in the transverse body 
dimensions, while after the first year leg length (and presumably arm length, which 
was not measured in this study) increases far more rapidly than any other body 
segment. 

Any sox differences to be found in these body proportions are, again, very small, 
and with a sample of the size of the present study, certainly not significant. How- 
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Figs. 15 — 21. Curves of Growth in Seven Body>build Indices. 
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ever, so far as this group goes, duriog the first six months there is a tendency for 
the girls to have wider hips in proportion to total length and stem length (the latter 
is less clear); while after six months the situation is reversed, and the boys have 
relatively larger hips. The boys usually have a relatively longer stem length, when 
compared to stature, and after month 16 they have a greater chest circumference. 
When chest circumference is compared to stem length, however, the circumferences 
of the girls are larger. All of the weight-length indices show that, for the most 
part, the boys are heavier relative to their length. 

Wallis (89) has computed recumbent stem length/length indices for New York 
private school children aged 1 to 8 years, and Merrill- Palmer school children, aged 
2 to 6 years. Her indices, though in general their trends are similar to ours, 
decrease with age much less abruptly at the ages in which the Growth Study group 
was measured in the upright position (see Fig. 23, p. 60). The New York group 
shows the same sex difference as ours (higher indices for the boys) with the 
exception of years 7 and 8. 

Davenport (16), following an extensive investigation of the statistical behaviour 
of the several indices cited above which are based upon weight and some power of 
height, concluded that either W./H.* or W./H.** was to be preferred as an index of 
body build to either W./H. or the W./H.® index which Bardeen (7) had argued for 
on logical rather than empirical grounds. The two former indices yield smaller 
measures of dispersion than do the two latter, with a given group of subjects at a 
particular age, and they also fit better the empirical picture of developing body 
build. 

Table XVIII (p. 60) adduces new evidence in support of the W./H.* index. (Since 
Quetelet's data showed both W./H.* and W./H.** to be good^ indices, with little to 
choose between them, the latter index was not computed for the data from the present 

[* The authors assume here that a '*good’’ index is one that correlates highly with the judges* 
ratings of ** chubbiness.** I should anticipate the reverse of this: see my footnote, p. 56. Ed.] 
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Oomparative Curves of the Stem Length : Length Index. (The New York and Merrill-Palmor Curves are 
based on Recumbent Measures (S9). Growth Study Measures are recumbent through month 18.) 


TABLE XVIIL 


Correlations of Indices of Body-build with Judges' Ratings. 


Index 

Months 

Mean 

r 

1 

3 

5 

7 

9 & 10 

12 

16 

18 

24 

30 

36 

W./L.2 

•75 

•71 

•83 

•75 

•84 

•79 

•77 

•72 

•66 

•84 

•79 

•76 

W./L.» 

■69 

•08 

•79 

•79 

•84 

•74 

•70 

•63 

•59 

•71 

•68 

•71 

W./L. 

•06 

•60 

•72 

•59 

•00 

•73 

•62 

•74 

•69 

•71 

•66 

•67 

H.W./L. 

•65 

•05 

•75 

•71 

•58 

•46 

•65 

•67 

•54 

•67 

•63 

•62 

H.W./S.L. 

•54 

•02 

•09 

•63 

•51 

•42 

•66 

•68 

•65 

— 

— 

•68 

L. X KKK) 













C.I>. X C.W. X S.L . 

— 

— 

1 

— 

— ' 

— 

•49 

•57 

— 

— 

— 

•53 

ac./L. 

•38 

•47 

•58 

•63 

•61 

•53 

•60 

•47 

•10 

•46 

•43 

•47 

C.C./S.L. 

•15 

•32 

•50 

•59 

•66 

•48 

•41 

•64 

•13 

— 

— 

•41 

S.L./L. 

-•22 


•28 

— 

— 

•19 

— 

-06 

-06 

— 

— 

•03 


sample.) For the first time, we believe, quantitative measures of body-build have 
been computed from seriatim data gathered from an approximately constant sample 
during the first throe years of life, and have been compared with ratings of body- 
build based upon photographs taken on the same days on which the measurements 
were made. The ratings, as has been explained, were given independently by two 
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judges (B and D) for the eleven ages at which photographs were taken. A third 
judge (W) made ratings for months 15, 18 and 24. 

For correlation with the several body-build indices the ratings of the judges 
were pooled. Table XVIII shows W./L.® correlating higher with the pooled photo- 
graph ratings, at almost every age, than any of the other indices which were 
similarly treated. The relative hip width and the relative chest girth ratios 
(especially the latter) appear as poor measures of body-build when correlated 
against the criterion furnished by the judges* ratings. An adaptation* of the Wert- 
heimer-Hesketh index (leg length x 1000/C.D. x C.W. x S.L.) was correlated with 
the photograph ratings for two ages only, 15 and 18 months, and yielded coefficients 
of *49 and *67 respectively. This index affoi-ds an example of an index the validity 
of which suffers because of the relatively low reliability of some of the measures 
entering into it. Though this objection probably holds for subjects of ages similar 
to those used in the present investigation, it does not necessarily hold for more 
mature subjects. This is indicated by the high correlations reported by Wertheimer 
and Hesketh betweeu their index extremes and the psychotic types with which, if 
Kretschmer’s conclusions were soundly based, asthenic and pyknic body builds 
should be correlated. 

The sharp drops found in the majority of the r’s in Table XVIII at the twenty- 
fourth month are probably to be attributed to a marked shrinkage in the number 
of cases available for measurement and photographing on that occasion, due to 
the illness of a number of subjects, and also to the necessity for eliminating the 
records of several other subjects who proved unusually refractory on their second 
birthday. If this one patently atypical correlation be omitted from consideration, 
the average correlation of the W./L.® indices with the judges* ratings, for ten ages 
between the limits of one month and three years, is *78. Including the twenty- 
fourth month, the mean of the r*s is *76. 

Table I (p. 34) gives the self-correlations of the W./L.® index, for one-month 
intervals throughout the first year, for three-month intervals from 12 to 18 months, 
and for six-month intervals from 18 to 86 months^ Five out of the sixteen coefficients 
are less than *80, but from the seventh month through the thirty-sixth (excluding the 
twenty-four month correlations as atypical, for the reasons given above) these self- 
correlations average *87. Including all ages, they average *82. Considering the 
rapid growth changes, especially in the early months, these correlations are probably 
minimum measures of reliability ; and remeasure-coefficients derived from measure- 
ments and remeasurements made on the same day would doubtless be found to be 
higher. 

Fig. 22 (p. 59) presents a curve for the development of body-build, using W./L.® 
as the best quantitative measure of body-build for the ages under consideration, for 
the sexes separately, and for the total group. Since a low index means relative 
slenderness and a high index indicates relative stockiness, it is seen that these 

* This adaptation was neoesaary, since we did not make exactly the same measures as did 
Wertheimer aud Hesketh (40). 
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curves make manifest a trend toward increasing stockiness from the first to the 
eleventh month, at which time the curves change direction. Two judges (B and W) 
studied the entire series of photographs, independently, in an attempt to estimate 
the age at which these subjects ceased to increase in stockiness and started to 
grow more slender. B made self-comparisons of the photographs of 62 cases and 
estimated the peak in the trend toward increasing stockiness for each child. The 
mean of these was 12*8 months. W similarly studied the photographs of 50 cases 
and arrived at a corresponding estimate of 11*6 months. The close agreement 
between the evidence afforded by the W./L.® curves and these independent estimates 
from photographs, with respect to the time of occurrence of this change in trend, 
lends further support to claims for the validity of the W./L.^ index as an indicator 
of body-build in infants and young children. 

Before leaving this matter of the curve of development of body-build, however, 
we should consider some other indices, and the claims made for them. One index 
proptised by Davenport is that of chest girth, divided by length, or relative chest 
girth (14). This index presents a picture of the development of body-build different 
from the W./T...^ curve. Davenport concludes (on the strength of the data presented 
graphically in his Pig. 2, p. 13) that “at six weeks the body has reached its maximum 
postnatal chubbiness. Another temporary loss in chubbiness occurs at about eight 
months, due perhaps to the cutting of the incisors.” Oiir W./L.* curve, however, puts 
the time of maximum chubbiness at eleven months instead of six weeks, and there 
is no such sudden falling off in chubbiness at eight months as portrayed in Daven- 
port’s curve. We incline to the belief that the irregularities in Davenport’s curve, 
including the sharp drop at eight months, are extraneous, and attributable primarily 
to racial heterogeneity in the cross-sectional samples, to the fewness of the cases 
in some of his samples (the number of cases in an age group ranges from 9 at 
month 12 to 37 at month 7), and to the fact that the measurements were taken 
by different examiners. This is increasingly evident when his curve is compared 
with our curve of relative chest girth (Fig. 24). Our curve shows an index decreasing 
with age from the 1st month (especially after the 9th month) which is far more 
regular than Davenport’s in its trend, showing no radical change in plumpness at 
the 8th month. 

Even allowing for the fact of selection in our own sample, and for the relatively 



Davenport’s Curve of Belative Chest Girth compared with the Growth Study Curve. 
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small number of cases, it still seems probable that the true curve of development 
in body^build is more nearly approached with the data from the present sample, 
and using the W./L.® index, than with Davenport's data, simply because the data 
are seriatim rather than cross-sectional, the measurements were taken by one 
person, and a more reliable and valid index of build is used. That our curve is 
typical is evidenced by the fact that the age curves of the W./L.® indices given in 
other studies, e.g. Quetelet(84) and Davenport (16), are all very similar, in this early 
portion of the curve, to our own. 

Further comparisons of the indices are afforded by their inter-correlations. 
These inter-correlations, arranged in order of size, are presented in Table XIX. 

TABLE XIX. 


[nter-Gorrelation of hndices of Body-build, 





Months 



Mean 


1 

6 

9 

18 

36 

H.W./L. X H.W./S.L. 

•861 

•952 

•938 

•824 


•894 

O.C./S.L. xC.C./L. 

•729 

•877 

•826 

•776 

— 

•802 

H.W./L. X W./X..2 

•804 

•753 

•706 

•8CK) 

•689 

•760 

H.W./S.L. X W./L.-^ 

•648 

•653 

•553 

•664 

— 

•627 

C.C./L. X W./L.2 

•454 

•602 

•624 

•721 

-570 

•594 

H.W./L. X C.C./L. 

•397 

•601 

•549 

•706 

•416 

•634 

H.W./S.L. X O.C./S.L. 

•327 

•542 

-623 

•607 

— 

•626 

C.C./S.L. xW./L.* 

•218 

•603 

•455 

•624 

— 

•460 

H.W./S.L. X C.C./L. 

•204 

•538 

•482 

•462 

— 

•422 

H.W./L. X C.C./S.L. 

*167 

•468 

•503 

•498 

— 

•407 

W./L.a X S.L./L.* 

•314 

•444 

— 

•272 

•141 

•272 


* Additional oorrelationa between W./L.* and S.li./Li. at 12 and 24 months are + *826 and + *186, 
respectively. 


In general, the indices which correlate most highly with each other are those in 
which there is high correlation (or identity) between the measures included in the 
two indices. There is also some evidence that the higher correlations are between 
indices composed of the more reliable inoasures. As between highly inter-correlated 
indices, those composed of the most reliable me4i.sures are of course more desirable 
as measures of build. The two highest r's are between measures in which the 
numerators are identical, and the denominators are length and stem length. Since 
the former is a more reliable measure, other things being equal, it should be 
considered the better index. The next three correlations, in order, are comparisons 
between width -length relations and W./L.*. Since the latter is made up of far 
more reliable measures than the others, it is obviously a more reliable index, and 
probably, also, a more complete measure of the width-length relationship than are 
any of these other indices with which it is compared*. 

[* The authors do not seem to have considered the influenoe of ‘‘spnrious** correlation on the 
correlation of these indices. £d.] 
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Of the other comparisons, we must consider the fact that where there is little 
correlation between two indices which are composed of reliable measures, either 
may be a valuable measure of differing body-build traits. In this connection, the 
correlations between W./L.* and S.L./L., which average only ’27, are worthy of 
consideration. These two indices are composed of the three most reliable body 
measures we have taken (not including head measures), yet they show practically 
no relationship to each other. The former correlates closely with the judges’ 
estimates of chubbiness, while the latter shows no relation to these estimates 
(Table XVIII, p. 60). It is in no way a measure of lateral versus linear tendency, 
but of independent linear proportions. It is possible that in considering the relation- 
ship of body-build to mental characteristics, both of these indices should be used. 
The Wertheiriier-Hesketh index, which takes account of both relationships, may 
correlate less with our empirical criterion in part because of this fact. Yet for this 
very reason, it may be more discriminating (where the individual measurements 
are reliable) in picking out the extreme builds in which short legs and long thick 
bodies (pyknic) go together and those in which long legs and short thin bodies 
(asthenic) coincide. The nature of the correlations between the two separate 
indices is evidence that for these early years at least, there is no tendency for such 
body-builds to occur much more frequently than would be expected from a chance 
distribution in which these are the extreme deviates. 

So far in our discussion, we have treated body-build as relative chubbiness 
or width-length relationship including subcutaneous tissue as well as skeletal 
structure. If we wish to limit our definition of build to the skeletal structure, we 
cannot with any certainty draw conclusions from our measures of external body 
dimensions, since these must always be affected to some extent by subcutaneous 
tissue. The width measure which is most reliable, hip width, is probably, because 
it is a simple linear measure, freer from the influence of subcutaneous tissue than 
is weight, and so may be a better index of skeletal proportions. The H.W./L. index 
(Fig. 15, p. 58) shows a change in proportions with age which does not coincide 
with our criterion of ratings of build. According to this index, there is a rapidly 
increasing lateral tendency through the sixth month, followed by an almost equally 
rapid decrease after the seventh month. But since the judges* ratings were made 
from photographs, they were necessarily aflected by subcutaneous tissue, and need 
not be an adequate criterion of skeletal proportions alone. The apparent increasing 
chubbiness between the seventh and eleventh months might be due entirely to 
general growth in subcutaneous tissue. Since we have seen in Figs. 9 and 10 
that hip width, chest width and shoulder width all continue to grow rapidly 
throughout the first year, we should expect increasing width relative to length 
for a longer period than six months. The absolutely greater increase of length, 
which is a much larger diameter, would mask a tendency to continued increase in 
chubbiness when the comparison is made with only one of the width diameters, 
and for this reason it is doubtful if the H.W./L. index gives us a true picture 
of changing build, either of skeletal proportions, or of proportions which include 
other tissues in addition to bony structure. 
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G. OONSISTBNOY OF BCTILD. 

Of the indices of body-build which we have computed, W./L.* has been found 
by an empirical criterion, which is based on subcutaneous ti^ue as well as skeletal 
proportions, to be the most valid measure of growth changes in chubbiness or 
slenderness of build. We have also found that it shows a high degree of con- 
sistency in consecutive measurements. How consistent is it over longer periods ? 
Granting average changes in body-build, for the total group, do the slender 
children remain relatively slender, and do the stocky infants continue to be the 
stocky three-year-olds ? 

(1) Individual Curves, 

Figs. 26 — 29 (p. 66) show growth curves in body-build (W./L.*) for selected 
subjects. Judge B, before consulting the body-build indices, named two cases 
whom she considered to be of definitely stocky, chubby build, two others whom she 
classified as definitely slender, and one other whose body-build, in her estimation, 
approximated the average of the group. The close correspondence between the 
group's average and the individual body-build curve of case 29 F (the one selected 
as “ typical” by judge B) is clearly indicated in the figure. Case 9 M, one of the two 
subjects estimated by i5 as having a definitely stocky build, is seen to be far above 
the group average for the first ten months of postnatal life. He reached his peak, 
however, in the curve of developing stockiness, at the age of five months, as against 
the group average of eleven months. But although his curve from the tenth month 
is much closer to the group's average curve of build than was the case in the early 
months, case 9M throughout is distinctly atypical in his tendency toward pro- 
nounced stockiness. Case 53 M, the other subject selected as definitely above the 
average in stockiness by judge B, presents a very different developmental picture. 
For the first five months this infant was fairly typical in his weight to height 
proportions, but between his fifth and seventh month he gained far more rapidly 
in weight than in length, and more rapidly than did the group as a whole, with 
the result that his W./L.* index changed strikingly, as indicated in the figure. 
The relative r61e of endogenous and exogenous factors in such sudden and unusual 
changes as this one is a matter which needs investigation. Since the group is 
a relatively homogeneous one, with the best of dietary advice equally at the 
command of all of the subjects' parents, the inference would seem to be indicated 
that such sudden shifts in metabolism are due largely to endogenous factors. When 
the shift is in the direction of a sudden sharp decrease in weight, rather than an 
increase, a history of recent illness would of course be looked for before attempting 
to explain the decrease as due to the influence of postnatal maturational factors. 
It is of interest to note that case 53 M's curve remains well above the average 
from the seventh month on, pointing to the establishment of a somewhat stable 
metabolic equilibrium in his organism at this supernormal level. 

Case 7 F presents the picture of a subject who, during the first three years of 
postnatal life, is consistently more slender than the average. From the third month 
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Figs. 25 — 29. Individual Curves of Cxrowth in the Weight : Ijength*** Index compared with Curves of the 
Moan and Plus and Minus one Standard Deviation. 

Fig. 30. Weight : liength^ Curves of two Cases who differ widely in their Tendency toward Constant 
Weight : Length Proportions. 


to the twelfth she is strikingly atypical in her tendency toward extreme slenderness. 
During the ten-months’ interval between her eight- and eighteen -month measures 
she gained in weight at a relatively faster rate than in height, even though after 
month eleven the group as a whole was showing the contrary tendency, and this 
deviation from the typical developmental trend results in this subject’s close 
approach to the normal at the eighteenth month. During the next six months, 
however, her curve changes direction once more and now falls away more sharply 
than does the average of the ^oiip, so that by the twenty-fourth month she is 
again well below normal. Case 18 M is an individual who, except for the third 
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month, is also consistently more slender than the normal infant. In contrast to 
7 F, however, this atypically slender baby reached his peak in the curve of in- 
creasing stockiness at the ninth month, and from that point on deviates more and 
more sharply from the mean. 

(2) Correlations. 

These individual curves indicate the results which we may expect when the 
indices are correlated over longer intervals to determine their consistency. The 
effect of sudden pronounced shifts — like those shown by case 53 M between the fifth 
and seventh months upon the rank orders of the subjects and consequently upon 
the correlation coeflBcients for the intervals of time during the course of which these 
shifts occurred — is obvious. When individual organisms are going through such rapid 
and pronounced readjustments, the correlation coetBcieots for the body characters 
involved in such readjustments will drop. A study of the consistency correlations, 
therefore, should be informative with respect to 'which developmental periods tend 
to be characterised by these pronounced individual divergences from the mean curve 
of growth, and which ones manifest relatively stable developmental characteristics 
on the part of the individual members of the group. Table XX gives these data. 

Self-Correlations showing Consistency of Body-Build. 


TABLE XX. 
W./L.* 


Month 

3 

6 

12 

18 

24 

30 

36 

1 

•68 

•48 



•17 


•10 

3 

— 

•64 

— 

•17 

•31 

— 

•28 

6 

— 

— 

•44 

— 

•24 

— 

•28 

12 

— 

— 

— 

•79 

•74 

— 

•76 

18 

— 

— 

— 

— 

•73 

— 

•80 

24 

— 

— 

— 

— 

— 

•74 

•73 

30 

— 

— 

— 

— 

— 

— 

•91 


It is evident at a glance that something is operating to reduce the longer-interval 
correlations into which the first month measurements enter. The W./L.* correlations, 
while they are *718 for the first and second months, and *741 for the second and 
third months (see Table I), become *68 for the first and third months (Table XX) 
and drop to *48 and to T7 for the first and sixth months and the first and twenty- 
fourth months, respectively. Similarly, the correlations of the sixth month indices 
with those for the twelfth, twenty-fourth and thirty-sixth months give the following 
coefficients, *44, *24 and '28. But when the twelve-month indices are correlated 
with those for the eighteenth, twenty-fourth and thirty-sixth months we find a 
considerably higher degree of stability in the rank orders of the subjects at these 
ages, with coefficients of '79, '74 and '75. Over the eighteen-month interval between 
the ages of 18 and 36 months, our subjects maintain their relative positions with 

6—2 
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respect to their W./L * index to the extent indicated by a coefficient of correlation 
of the order of *80, while for the thirtieth and thirty-sixth months the correlation 
is *91. 

The consistency correlations for the H.W./L. index (Table XXI) are very similar 
to those for W./L.*. The r*8 of the former are more often larger when the first and 
sixth months are correlated with later ages, while the latter are higher when the 
twelfth and twenty-fourth months are compared with later ages. None of the 
differences, however, is great enough to be significant. And, in general, the H.W./L. 
index shows no greater consistency in build than does the W./L.* index. 

These correlations, and also the individual curves, show that there is little 
consistency in lateral-linear build during the first year — just as there is little 
consistency in the relative size of weight and all width rneaaures. But after the 
twelfth month the correlations increase somewhat and the children appear to have 
established a body-build which is comparatively stable — at least over a two-year 
period. 

^elf ■‘Correlations shmving Consistency of Body- Build, 


TABLE XXI. TABLE XXI 1. 

H.W./L. S.L/L. 


Month 

6 

12 

24 

36 


Month 

5 

12 

18 

24 

36 

I 

•50 

•24 

•27 

•13 


1 

+ 12 

-06 


-11 

-07 

6 

— 

•34 

•47 

•46 


5 

— 

+ •30 

— 

+ •26 

+ 16 

12 

— 

— 

•70 

•66 


12 


— 

+ •28 

+ •37 

+ 13 

24 

— 

— 

— 

•72 


18 

— 

— 

— 

— 

+ •62 







24 

— 

— 

— 

■ — 

+ •42 


Consistency correlations were computed for another index composed of reliable 
measures, S.L./L. (Table XXII). All the coefficients arc low for all ages, even 
though the measures used in this index are relatively consistent. It is probable 
that since the legs are continuing to grow rapidly through the thirty -sixth month, 
individual differences in growth trends would obliterate any tendency for consistent 
stern-stature relationships within the age limits hero considered^. 

Body -build types may be manifested during the first year in the growth changes 
of the W./L* index rather than in the child’s relative build at any given age. 
As we have seen (Figs. 25 — 29, p. 66), when the age curves of this index are plotted 
for each child separately, we find wide individual difference in the trends of the 
curves. Some of the children (for example, cases 9 M and 18 M) show an early 

* Hejinian and Hatt(25) plotted individual curves of the percentile rank of S.Ij./Ij. index for 
50 children aged 24 to 60 months. Seventy-two per cent, of these showed oonaistenoy of build, but the 
others distinctly changed their percentile rank. This body proportion, then, remains unstable for many 
children even to five years of age. 




Plate 1(a). i month 


Biometrika, Vol. XXVII, Parts I and II 

Iin)'U>y iind Davis : Ch oivth Chaft^i^rs in Infants 


Plate I 



Plate I (C l. 30 months 


Biometrika, Vol. XXVII, Parts I and II 

Baylcy and Davis: ih'otvth C/ianifc’s in Infant': 


Plate I {bis) 



9 W was extremely chubby at 5 months. 





Plate II (c). 36 months 


Biometrika, Vol. XXVII, Parts I and II 

Hayley and l:)avis : Growth C?iangcs in Infants 


Plate II (his) 




Comparisons between a consistently small child (left) and a consistently large child (right) at three age levels: 

1, 9 and 36 months. 
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increase in chubbiness^ md others a late increase (case 7 F), as compared with the 
majority, who follow the curve of the mean. A number of the more slender children 
have little or no tendency toward increasing W./L.® indices during the first year, 
but maintain a comparatively constant index throughout the entire three years. 
The W./L.® curves of two girls, cases 20 F and 65 F (Fig. 30), illustrate the wide 
divergence from the mean in tendency to change in body-build. Case 20 has an 
index which is close to the mean for the first four months. After this time it increases 
rapidly to the eighth month, stays high through the fifteenth month, and then 
decreases at a rate far more rapid than the average, until at the thirtieth and 
thirty-sixth months her index has returned very near to the mean. Case 56, on 
the other hand, has indices which are seldom far from the mean, though they are 
well above it during the first six months. Her index curve is almost a straight line, 
and, except for the rise between the first and second months, and a slow decline 
after the fifteenth month, shows no tendency towards a changing W./L.® index. 

When the group is divided into halves, one half containing children with less 
than average tendency to increased W./Ij.® index during the first year, and the 
other half containing those with greater tendencies to chubbiness, the slender group, 
as would be expected, has a much higher consistency correlation between the sixth 
and twelfth months (r is *87 for 26 cases) than is found for the total group (*44). 
However, their consistency correlations drop later, and are lower than for the total 
in the r between measures at the twelfth and twenty-fourth months (*53) and 
between the twenty- fourth and thirty-sixth (*57). Of the entire group of children, 
only one (46 M, Plate I, see p. 32) was found to be so consistently heavy that his 
W./L.® index did not once fall within one standard deviation of the mean for the 
17 ages at which the children were measured. One other child (9M, Plate II, see 
p. 32) remained above one standard deviation for all but three ages. Of the 
slender children, one (67 F, Plato I) remained below one standard deviation for all 
but two ages, while another (27 F, Plate II) remained below for all but three ages. 

Infant trends in body build may be characteristic of glandular and metabolic 
balances in the individual and so predictive of build later in life. This we cannot 
know until we can compare these early trends with measures of the same individuals 
later. But we have found that no single index (composed of the measures we have 
taken) of an infant under one year can be considered as representative of a constitu- 
tional type which will continue to be one of its characteristics for any protracted 
length of time. 


SUMMARY AND CONCLUSIONS. 

1. A seriatim growth study of nine measures of body size was made on a group 
of from forty-six to sixty-one infimts who were measured at 17 agea during their 
first 36 months. 

2. The measures were found to be sufficiently reliable to indicate growth trends 
of individuals; there are indications that some measures are more reliable than 
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others, the highest retneasurement-coefficieats being found for length, weight and 
stem length. 

3. The group wa43 found to be typical of other groups reported in the literature, 
except that after the sixth month its members averaged taller and heavier than 
children in other parts of the country, with the exception of selected groups such 
as those mentioned in the recent Iowa studies. 

4. Growth in all dimensions was retarded, rapidly at first, and then more slowly 
through 36 months. 

5. In proportion to their size at the fii*st month, the width measures of hip, 
chest and shoulder increased more rapidly during the first year than any other 
measure except weight. After this age, the greatest proportionate growth was in 
length. Chest depth increased even less than did head circumference. 

(». In variability there were no significant changes with growth, nor between 
the sexes. 

7. The boys were consistently a little larger for all measures than the girls. 

8. Correlations over the three-year period show that head circumference, length 
and stem length are the most consistent measures ; that is, children with large heads 
at the first month tend to have relatively large heads at the .thirty-sixth month 
and those with small heads usually continue to have small heads, while the same is 
true of lengths. Weight and width measures show only slight consistency before 
the twelfth month, but with the exception of two unreliable measures (chest width 
and depth) are fairly consistent after this age. 

9. Hip width and weight are more closely correlated than any other pair of 
measures. Length and stem length are also highly correlated. Other measures of 
body width correlate fairly highly with weight; while the width-length comparisons 
show less relationship though they are still positively correlated. The smallest 
correlations are found between shoulder width and stem length and average *53. 

10. There is a decrease in some of the correlations between measures of widths 
and lengths during the first half of the first year. This is true of the r*s between 
weight and length, hip width and length, hip width and stem length, and less 
clearly between hip width and shoulder width. These decreased r*s appear to occur 
more often at ages when the measures involved are growing at disproportionate 
rates. Thus, the width-length correlations drop during the ages when most children 
are rapidly becoming chubbier. 

11. Growth changes in body proportions are shown by curves of indices, 
comparing length measures with width measures and with weight. Most of these 
show proportionately increasing width during some portion of the first year. 

12. A comparison of these indices with ratings of body build leads us to the 
conclusion that W./L.* is (of the indices studied) the most valid measure of relative 
chubbiness, or lateral-linear tendencies in build. 



Nancy Bayley and Prank O. Davis 71 

13. The W./L.* index shows a curve of rapidly increasing chubbiness during the 
first eleven months, with a slow decrease in chubbiness after this age. 

14. Of the three indices of build whose values were correlated for consistency 
over a long age interval, namely, W./L.*, H.W./L. and S.L./L., none indicated con- 
sistency of build during the first year, though W./L.* and H.W.yL. show increasing 
consistency after this age. 

16. We cannot predict from indices at an early age what an individuars body- 
build will be later. It may be possible, however, to predict later build from early 
trends in build. Our data give no clear evidence on this point. 

16. No significant sex differences in rates of growth were found for this group, 
though the boys were found to be slightly but consistently heavier for their length 
than the girls. 

No^'E. Since this memoir was set up a letter has reached the Ekiitor from 
Dr Bayley, which he thinks somewhat explains their different conceptions of 
“chubbiness.'’ No relation of width to length would conform to his conception of 
chubbiness = plumpness. Nor did he after a study of Kretschmer's work based on 
no extensive statistical investigation give credit to his classifications or conclusions. 
The following are the important paragraphs from Dr Bayley's letter: 

“I have been trying out some of the indices you suggested. They should prove 
of value in studies of nutrition. However, in the study under consideration, I was 
most interested in finding an index which would differentiate types of body-build 
such as those described by Kretschmer (pyknic-leptosome), or Davenport (lateral- 
linear), and not so much interested in obtaining a measure of nutrition or health. 
This latter problem is one I intend to take up in connection with a study of 
physicians' ratings of the children's health. 

“It would, of course, be of value to find an index which differentiates the 
children and is constant at all ages, and in which the individual children maintain 
the same relative position in the group. None of the indices for which I have made 
the comparisons has remained consistent, so that no child who is classified in 
body-build at one age can be expected to have the same build at another age. 
“Chubbiness" as we used it in our ratings of photographs meant the tendency to 
be wide, relative to length, rather than to have excess fatty tissue, though this 
latter factor had a strong influence in the rating. 

“It still seems to me that the index I am looking for is one which expresses 
width relative to length, and all of my data indicate that such an index would show 
changes with age, and would not be constant. This finding in itself should be of 
importance in understanding the processes of growth in young children." 
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TABLE XXIII. 

ANTHROPOMETRIC^ MEASURES—INFANT GROWTH STUDY. 


((/.) Cheat Circumference Frequency Distributions. 


Month 1 


Month 2 

Cm. 

F 

430 

1 

42-5 

— 

42-0 

2 

41-5 

2 

41 0 

— 

40*5 

1 

40-0 

3 

39C 

5 

390 

8 

38-6 

4 

380 

7 

37*6 

3 

370 

3 

36-5 

2 

3(5*0 

4 

35*5 

4 

35*0 

1 


Month 3 



Month 4 

Month 5 

Mor 

Cm. F 

Cm. F 

Cm. 

46*5 1 

47*5 2 

49*6 

46*0 — 

47*0 2 

49*0 

45*6 1 

46*5 — 

48*5 

450 1 

46*0 1 

48*0 

44*5 

46*5 2 

47*6 

44*0 1 

46*0 2 

47*0 

43*5 2 

44*5 5 

46*5 

43*0 5 

44*0 3 

46*0 

42*5 4 

43*5 6 

46*5 

42*0 6 

43*0 6 

45*0 

41*5 4 

42*5 3 

44*5 

41*0 4 

42*0 6 

440 

40*5 5 

41*6 4 

43*5 

40*0 6 

41*0 3 

43*0 

39*5 3 

40*6 3 

42*6 

39*0 4 

40*0 2 

42*0 

38*5 1 

39*5 1 

4J*6 

38*0 2 

39*0 — 

41*0 

37*5 — 

38*6 1 

40*5 

37*0 — 


40*0 



39*5 


Month 7 


Month 8 


Month 9 


Month 10 


Month 11 


Month 12 
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TABLE XXIII — (continued). 


Month 7 

Month 8 

Month 9 

Month 10 

Month 11 

Month 12 

Cm. 

F 

Cm. 

F 

Cm. 

F 

Cm. 

F 

Cm. 

F 

Cm. 

F 

180 

1 

18-0 

1 

18-3 

1 

17 9 

1 

17-8 

2 

181 

2 

17*5 

1 

17*7 

— 

180 


17-6 

— 

17*5 

1 

17-8 

1 

170 

— 

17-4 

1 

17-7 


17-3 

2 

17-2 

1 

17*6 

1 

16-5 

3 

171 

3 

17-4 

— 

170 

4 

16-9 

2 

17*2 

2 

160 

2 

16-8 

2 

17*1 

1 

16-7 

1 

16-6 

1 

16-9 

— 

15-5 

9 

16-5 

1 

16*8 

4 

16-4 

4 

16-3 

6 

16-6 

6 

160 

7 

16*2 

7 

16-5 

2 

16- 1 

2 

160 

4 

16-3 

4 

14-6 

12 

J6-9 

1 

16-2 

2 

16*8 

4 

16-7 

3 

160 

3 

140 

5 

16*6 

3 

16-9 

3 

16-5 

6 

16-4 

2 

16-7 

6 

13*6 

7 

15*3 

6 

15-6 

9 

16-2 

5 

151 

2 

16-4 

3 

13*0 

2 

160 

5 

16-3 

2 

14-9 

6 

14-8 

4 

16- 1 

1 

12-6 

2 

14-7 

7 

160 

7 

14-6 

4 

14-5 

2 

14-8 

1 



14-4 

4 

14-7 

3 

14*3 

— 

14-2 

2 

14-5 

2 



141 

— 

14-4 

6 

140 

4 

13-9 

3 

14*2 

2 



13-8 

6 

14- 1 

4 

13-7 

2 

13-6 

1 

13-9 

— 





13-8 

2 

13-4 

4 



13-6 

2 





13-5 

1 











13-2 

1 







1 




12-9 

— 








Month 16 

Month 18 

Month 24 

Month 30 

Month 36 

Cm. 

F 

Cm. 

F 

Cm. 

F 

Cm. 

F 

Cm. 

F 

18*3 

2 

18*9 


19-7 

1 

19-6 


19-9 

1 

18-0 

— 

18-6 

3 

19-4 

— 

19-3 

1 

19-6 

4 

17-7 

1 

18*3 

— 

191 

— 

190 

1 

19-3 

1 

17-4 

3 

180 

3 

18-8 

— 

18*7 

2 

19-0 

2 

171 

2 

17-7 

2 

18-6 

2 

18-4 

2 

18-7 

4 

16-8 

1 

17*4 

2 

18-2 

4 

181 

6 

18-4 

4 

16-6 

6 

17*1 

3 

17-9 

1 

17-8 

6 

18-1 

13 

16*2 

3 

16-8 

2 

17-6 

6 

17-6 

6 

17-8 

8 

16-9 

2 

16-6 

6 

17-3 

7 

17-2 

5 

17-6 

1 

16-6 

4 

16-2 

4 

17-0 

4 

16-9 

4 

17-2 

2 

15-3 

4 

16-9 

6 

16-7 

3 

16-6 

2 

16-9 

4 

160 

5 

16-6 

1 

16-4 

5 

16-3 

1 

16-6 

2 

14*7 

2 

16*3 

3 

16-1 

1 

16-0 

1 

16-3 

1 

14-4 

1 

160 

1 

16-8 

— 

16-7 

— 

160 

— 

141 

— 

14*7 

— 

16-6 

— 

15-4 

— 

15-7 

1 

13-8 

— - 

14*4 

— 

15-2 

— 

161 

— 

16-4 

1 

13-6 

— 

14-1 

— 

14-9 

3 

14-8 

1 

1 
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TABLE yiXm—ic<mUnued). 

(e) Head Circumference Frequency Distributions. 


Month 1 

Month 2 

Month 3 

Month 4 

Month 5 

Month 6 

Cm. 

F 

Cm. 

F 


F 

Cm. 

F 

Cm. 

F 

Cm. 

F 

41 0 

1 

430 

1 

44*5 

1 

460 

1 

460 

1 

47*0 

1 

40-5 

1 

42*5 

— 

44*0 

— 

44*5 

— 

45*5 

1 

46*5 

— 

400 

1 

420 

__ 

43*5 

— 

44*0 

4 

45*0 

3 

46*0 

5 

39*5 

2 

41-5 

1 

43*0 

2 

43 5 

3 

44*5 

2 

46*5 

3 

39-0 

4 

4 10 

7 

42*6 

4 

43*0 

4 

44*0 

11 

45*0 

11 

38*r> 

5 

40*5 

4 

42*0 

3 

42-6 

10 

43*5 

9 

44*5 

6 

38-0 

9 

400 

9 

41-5 

8 

420 

12 

43*0 

8 

44*0 

6 

37/> 

7 

39*5 

10 

41*0 

12 

41*5 

7 

42*5 

6 

43*5 

8 

370 1 

9 

390 

10 

40*5 

10 

41*0 

8 

42*0 

7 

43*0 

6 

36-5 

4 

38*5 

5 

400 

7 

40*5 

4 

41*5 

! 3 

42*5 

4 

360 

1 

380 

6 

39*5 

3 

40*0 

2 

41*0 

i 4 

42*0 

3 

35-5 

4 

37*5 

2 

390 

4 

39*5 

1 

40*5 

— 

41*6 

2 

350 


370 

2 

38*5 

2 

39*0 

1 

40*0 

i 1 

41*0 

— 

34-5 


36*5 

1 

380 

1 

38*5 

— 

39*5 

1 — 

40*5 


34*0 

— 

360 

— 

37*5 

— 

38*0 

— 

39*0 

I 1 

40*0 

1 

33-6 

1 

1 

35*5 

1 

1 

370 

1 

37*5 

1 


1 

i 




Month 7 

Month 8 

Month 9 

Month 10 

Month 11 

Month 12 

(^m. 

F 

Cm. 

F 

Cm. 

F 

Cm, 

F 

Cm. 

F 

Cm. 

F 

48*6 

1 

48*5 

1 

49*5 

1 

500 

1 

50*0 

2 

61 0 

1 

48*0 



48*0 

1 

49*0 

— 

49*5 

1 

49-5 

— 

50*6 

1 

47*5 

— 

47*5 

3 

48*5 

3 

49*0 

1 

49*0 

3 

500 

1 

47*0 

3 

47*0 

5 

48*0 

— 

48-6 

2 

48*5 

6 

49*5 

2 

46*5 

3 

46*5 

5 

47*5 

7 

48*0 

6 

48*0 

6 

49*0 

3 

46*0 

7 

46*0 

4 

47*0 

4 

47*5 

7 

47*5 

6 

48*5 

7 

45*5 

8 

45*5 

10 

46*5 

7 

47*0 

5 

47*0 

3 

48*0 

5 

45*0 

7 

45*0 

7 

46*0 

5 

46*5 

4 

46*5 

8 

47*5 

6 

44*5 

3 

44*5 

7 

45*5 

9 

46*0 

6 

46*0 

2 

47*0 

5 

44*0 

8 

44*(» 

1 

45*0 

3 

45*5 

5 

45*5 

O 

46*5 

3 

43*5 

2 

43*5 

1 

44*5 

6 

45*0 

3 

45*0 

3 

46*0 

6 

43-0 

1 

43*0 

3 

44*0 

— 

44*5 

4 

44*5 

2 

45*5 

— 

42*5 

5 

42*5 

— 

43*5 

3 

44*0 

1 

44*0 

0 

45*0 

1 

42*0 

1 

42*0 

— 

43*0 

— 

43*5 

— 

43*5 

0 

43*5 

— 

41*5 

— 

41*5 

1 

42*5 

— 

43*0 

— 

43*0 

I 1 

43*0 

— 

41*0 

1 



42*0 

— 

42*5 

— 



42*5 

1 





41*5 

1 

1 

42*0 

1 
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Month 15 


Month 18 


Month 24 


Month 30 


Month 36 


(y*) Length Frequency Distributions, 


1 


Month 2 Month 3 Month 4 Month 5 


Month 6 


70-5 

1 

72-6 

1 

700 

I 

720 

1 

69-5 

1 

71-5 

1 

690 

1 

71*0 

2 

68*5 

4 

70*6 

3 

68*0 

2 

70*0 

2 

67-5 

2 

69*6 

3 

670 

7 

69*0 

2 

66*5 

2 

68*5 

3 

660 

3 

68*0 

4 

65 5 

6 

67*5 

7 

650 

4 

67*0 

7 

64-6 

5 

66-6 

2 

64*0 

3 

66*0 

4 

63*5 

6 

66*6 

3 

63*0 

4 

66*0 

4 

62-5 

1 

64*5 

1 

62*0 

1 

64*0 

4 

61*5 

— 

63*6 

— 

61*0 

1 

03*0 

1 

60-6 

— 

62*6 

1 

600 

1 

62*0 

— 

69-5 

— 

61*6 

— 

690 

— 



58*5 

1 
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TABLE XXIII — {ooKtinued). (g} Weight Frequency Distributiona, 


Month 1 


Month 2 


Month 3 


Lb. Oz. 


Month 4 


Lb. Oz. 


Month 5 


Month 6 


Lb. Oz. Lb. Oz. 
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TABLE X.XU1— (continued). 


Month 16 

Mouth 18 

Month 24 

Month 30 

Month 36 

Lb. 

Oz. 

F 

Lb. 

Oz. 

F 

Lb. 

Oz. 

F 

Lb. 

Oz. 

F 

Lb. 

Oz. 

F 

31 

8 

1 

32 

9 

2 

37 

0 

1 

41 

0 

1 

44 

0 

1 

31 

0 



31 

9 

2 

36 

8 

— 

40 

0 

— 

43 

0 

— 

30 

8 

1 

30 

9 

3 

36 

0 

— 

39 

0 

— 

42 

0 

— 

30 

0 

— 

29 

9 

2 

35 

8 

— 

38 

0 

1 

41 

0 

2 

29 

8 

— 

28 

9 

2 

35 

0 

— 

37 

0 

2 

40 

0 

— 

29 

0 

2 

27 

9 

3 

34 

8 

3 

36 

0 

2 

39 

0 

2 

28 

8 

1 

26 

9 

10 

34 

0 

— 

35 

0 

3 

38 

0 

1 

28 

0 

— 

26 

9 

8 

33 

8 

— 

34 

0 

1 

37 

0 

3 

27 

8 

6 

24 

9 

2 

33 

0 

2 

33 

0 

2 

36 

0 

1 

27 

0 

3 

23 

9 

6 

32 

8 

1 

32 

0 

6 

36 

0 

4 

26 

8 

2 

22 

9 

6 

32 

0 

3 

31 

0 

7 

34 

0 

7 

26 

6 

1 

21 

9 

1 

31 

8 

— 

30 

0 

6 

33 

0 

6 

26 

8 

3 

20 

9 


31 

0 

1 

29 

0 

2 

32 

0 

4 

26 

0 

4 

19 

9 

— 

30 

8 

2 

28 

0 

6 

31 

0 

6 

24 

8 

4 

18 

9 

J 

30 

0 

— 

27 

0 

4 

30 

0 

5 

24 

0 

6 




29 

8 

2 

26 

0 

1 

29 

0 

5 

23 

8 

1 




29 

0 

4 

25 

0 

— 

28 

0 

1 

23 

0 

3 




28 

8 

6 

24 

0 

1 

27 

0 

— 

22 

8 

2 




28 

0 

3 

23 

0 

— 

26 

0 

1 

22 

0 

2 




27 

8 

2 

22 

0 

1 

25 

0 

— 

21 

8 

6 




27 

0 

1 




24 

0 

1 

21 

0 

2 




26 

8 

3 







20 

8 

1 




26 

0 

3 







20 

0 

1 




26 

8 

2 







19 

8 

1 




25 

0 

2 







19 

0 

— 




24 

8 

1 







18 

8 

— 




24 

0 

— 







18 

0 

— 




23 

8 

1 







17 

8 

— 




23 

0 

— 







17 

0 

— 




22 

8 

1 







16 

8 

1 
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Month 15 Month 18 Month 24 Month 30 Month 36 



{%) Shoulder' Width Frequency Disti'ibutions, 


Month 1 I Month 2 Month 3 Month 4 Month 5 Month 6 
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TABLE XXIII — (continued). 


Month 7 


!m. F 



Month 9 

Cm. 

F 

223 

1 

220 

2 

21*7 

2 

21*4 

2 

211 

6 

20-8 

— 

20-6 

6 

20-2 

2 

19*9 

4 

19-6 

3 

19*3 

7 

190 

2 

18-7 

5 

18-4 

5 

181 

— 

17-8 

— 

17-6 

— 

17-2 

— 

16*9 

1 

1 


Month 12 


Month 15 


Month 18 


Month 24 


Month 30 


Month 36 
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ENDOCRANIAL DIAMETERS AND INDICES. A NEW 
INSTRUMENT FOR MEASURING INTERNAL 
DIAMETERS OF THE SKULL. 

By K. WAGNER, TJniversiteteis Anatomiske Institutt, Oslo. 

1. Introduction. 

The close correlation between form of skull and of brain is so conspicuous a 
phenomenon that on superficial consideration it might seem unnecessary to make 
it the subject of a statistical study. When we bear in mind, however, the large bony 
protuberances that are characteristic for most of the prehistoric skulls, the correlation 
at once becomes less obvious. And everybody who has tried to make comparisons 
as to th<^ shape of the brain-case between prehistoric and more recent races of man, 
or even betwc^en recent but remotely related races, will undoubtedly have felt the 
need for a statistical study of the matter. Still greater will be the need therefore 
in an examination of the human race’s phylogenetic development, where the shape 
and size of the brain will always be the central point in the work of research and 
where also the anthropoids must be taken into consideration. The crania of these latter 
are, as we know, providtid with large muscle ridges. In many works on the craniology 
of the anthropoids it is just this considerable incongruency between the shape of 
the brain and the outer form of the cranium that has been the technical stumbling- 
block. In this connection it is sufficient to mention the well-known work of 
Stephanie Oppenheim ; Zur Typoloyie des PriTnatencraniums (20). It is sought to 
avoid the difficulties by selecting such points of measurement as will eliminate as 
far as possiVde the infliumce of the thickness of the cranial wall, but it cannot be 
said that in this way it has been found possible to secure a technique which is 
quite comparable with that employed in human craniology. 

Schwalbe in his famous work (28) on the calotte of Pithecanthropus erectus Dubois 
mentions that the breadth-length index of this fragment cannot be directly com- 
pared with the breadth -length index for recent skulls — likewise on account of the 
cranium’s “ Aussenwerke” (outworks), as Schwalbe (after Virchow) aptly designates 
the Torus supraorbitalis and similar formations. Thus already in this work the main 
stress is laid upon the measurements and indices found on casts of the cranial cavity 
partly on casts of the Pithecanthropus fragment, partly on casts of anthropoid crania 
examined for comparison In the following paper we shall deal with other more 
special problems connected with the greatly varying thickness of the cranial wall, 

* Note: Liater on, in 1902, Bchwalbe took up the same question for renewed treatment in his 
“Beziehungen zwischen Innenform und Aussenform des Schadels,” but unfortunately it is mainly the 
cranio-oerebral topography that is dealt with, i.e. a field that has only indirect interest for anthropology. 

Although the anthropology of the brain has in later years taken a prominent place in the work of 
re8earch-~cf. Arifins Kappers’ book on the development of the nervous system, in which the chapter on 
“Data for an Anthropology of the Brain” occupies one- third of the number of pages ( 12 ) only few 



BL WAaKKK 


80 


but what has alr^uJy been said is sufficient to show the need for an instrument that 
will render possible a direct determination of the three chief diameters of the cranial 
cavity, without sawing through the skull and without making casts. Over a year 
ago I therefore addressed myself to T. Gundersen, instrument^maker, Oslo, and 
submitted the task to him. At that time 1 was not aware that H. Weinert had in 
1925 already described an instrument aiming at the same purpose: “Ein neu^ 
Messzirkel zur Ermittelung von Innenmassen*’ (80). This instrument, which moreover 
is also mentioned in the latest edition of Martin’s Lehrhuch der Anthropologies), 
is later described in connection with a large work on the development of the Sinus 
frontalis (81). In the same work we are promised a more systematic comparison 
between the exterior and interior form of the skull — '‘eine eingehendere Arbeit 
(liegt) im Manuakript bereits vor” (S. 354), but, so far as I know, this work has not 
yet been published. 

The measurements with the new instrument were already far advanced when 
I became aware of Weinert’s work. Afterwards, in my study of the literature, I 
met with an article in Bionietrika dealing in part with the same problems, namely, 
Hoadley and Pearson’s ** On Measurement of the Internal Diameters of the Skull 
in relation I. To the Prediction of its Capacity, II. To the ‘Pre-eminence’ of the 
Left Hemisphere” (9). As appears from the title of that paper two main problems 
are there taken up for discussion. In the first part of the work the correlation of skull 
capacity with the diameters of the cranial cavity is compared with the correlation of 
skull capacity and external measurements. In other words, the authors investigate 
whether it is possible by internal measurements to arrive at a more reliable formula 
for calculation of capacity. In the second part of the work they take up the old 
question of the sagittal asymmetiy of the cranial cavity and consequently of the 
brain. The prevailing assumption that the left cerebral hemisphere, both in general 
and as regards certain functions {e.g. the speech centres), occupies a leading position 
as compared with the right hemisphere is, in Hoadley and Pearson’s opinion, not 
justified by reliable morphologico-biometric observations*. And their investigations 


attempts have hitherto been made to deal with the form of the brain in its entirety from an anthro- 
pologioal standpoint. In the literature we can find a number of biometric works on details of the 
different parts of the brain, but the questions which are of essential importance for anthropology are 
scarcely even formulated, not to speak of their being answered. And this in spite of the statement made 
by Elaatsch (14) more than twenty years ago: ‘*Nicht die individuellen Versohiedenheiten der einzelnen 
Furchen sind es, auf die es ankommt, sondem die Kombinationen von Winduugs- und Furchungs- 
aystemen, an den Hauptpartien, nioht minder aber auch auf die Gesamtform der grossen Absohnitte 
und des ganzen Grosshims.” l^raiseworthy attempts in this direction have recently been made by the 
American investigators Weil (89), Pickering (81) and Connolly (7), while Hrdlidka*s work on the fossae 
cranii, published as early as 1907 (10), must be mentioned also in this connection. 

[* This statement is liable to a misunderstanding on the part of the reader. Pearson and Hoadley 
found that the length of the right hemisphere in their material was greater than that of the left ; an 
examination of the literature showed that the more reliable investigators found the total right hemi- 
sphere to be heavier than the total left. The writers of the paper therefore concluded for the size and 
weight pxedomlnanoe of the right. But they stated finally that : 

We do not consider that this is in any way opposed to the view that — at anv rate for some functions — 
the left hemisphere may predominate, because we do not believe that volitimml predominance is of 
necessity associated with stich gross characters as size and weight, loc cit,, p. 117. £i>.] 



90 


Endocranial Diameters and Indices 


of the longitudinal development of the two halves of the cranial cavity point 
decisively in the opposite tlirection. In this investigation they deal with a very 
large and at the same time extremely homogeneous series (739 adult male skulls 
of the long 26th — 30th Dynasties Egyptian Series in the Biometric Laboratory), 
but, as will be explained more fully later on, my measurements have nevertheless 
led to the opposite result. 

Hoadley and Pearson make use of Weinert’s instrument and, in contrast to him, 
they give a very full description of the technical difficulties, which are by no means 
inconsiderable and which, moreover, we shall revert to later. The authors, as stated, 
investigated only male skulls belonging to one race. By extending the investigation 
to both sexes and to several races it would, of course, be possible to deal with a 
greater number of problems. Not only should we thereby be enabled for the first time 
to provide a systematically prepared body of comparative material for other investi- 
gators who wish to draw parallels regarding the main form of the brain in anthropoids 
and in prehistoric or n^cent racf\s of mankind, but we should also get an opportunity 
of discussing the question as to whether — and, if so, to what degree — the craniological 
racial differences which anthrojjology has demonstrated are accompanied by similar 
differences in the shape of the brain. Thus it is the racial differences that form 
the chief object of my investigation, and not, as in Hoadley and Pearson's work, 
the laws applying for the homo sapiens as a genus. In consequence of the constantly 
proceeding interaction betw<?en shape of skull and shape of brain during each 
individual's development it is impossible — at any rate for the present — to establish 
whether the point of attack for the fundamental factors of heredity is (a) on the 
brain itself, (6) on the bones of the skull, or (c) on both brain and skull. But it 
must be possible to ascertain the final result of this mutual interaction, provided 
we have sufficient material at our disposal. It is clear that the problem can here 
be dealt with only in broad outline, as we must limit the investigation to certain 
races. It is furthermore clear that when speaking of brain form and asymmetry of 
the brain in this paper, I mean so far as it is revealed through a study of the 
skull cavity. We have to disregard the influence of the meningeal membranes, 
but this point will certainly be of minor importance. In concluding from the 
transverse asymmetry of the skull cavity as to the probable transverse asymmetry 
of the brain, I have made one more assumption which will be mentioned in due 
place. 

It will be understood that my task is closely connected with the racial signifi- 
cance of the thickness of the cranial wally a matter that has received comparatively 
little attention in craniological literature. And as the variation in wall thickness 
undoubtedly leads to the inter-racial formulae for calculation of capacity giving 
relatively imperfect results, by extending the investigation to different races we 
also are led to examine that question. 

Among other problems of immediate interest which Hoadley and Pearson do 
not deal with may be mentioned one connected with the asymmetry of the skull. 
As is known, Elliot Smith as early as 1907 ( 26 ) pointed out that the left upper 
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occipital fossa is generally somewhat more strongly developed than the right. 
A difference in this respect was found in 80 % of a rather large series of Egyptian 
skulls and the author, as is reasonable, considers a greater development of the left 
posterior lobe of the cerebrum to be the fundamental cause of this. At the same 
time this phenomenon is deemed to bear relation to the usually occurring greater 
development of the sinus transversus on the right side; Elliot Smith assumes 
that there exists here a simple causal relationship: '*The fulness of the left 
occipital pole seems to be the reason for the dextral bending of the superior 
longitudinal sinus, the flattening of the right pole allowing more room for the 
bigger sinus on that side” (p. 577). I shall later on take up this question for further 
discussion. For the present I shall merely mention that M. L. Tildesley (27) from 
a statistical treatment of a rather large body of material has not been able to 
confirm Elliot Smith’s finding as regards this last-mentioned correlation. 

In what has been said above the plan of the present work is made clear. After 
having first described the instrument and mentioned the technical difficulties — 
which we hope to find considerably diminished in comparison with Weinert's 
procedure — we shall discuss the absolute measurements and the three main indices 
calculated therefrom, with special regard to their anthropological significance. 
In connection therewith I shall take up the question of the form of the brain in 
prehistoric man and in the anthropoids, where material for comparison is still 
unfortunately all too scanty. In Section V I shall deal with the asymmetry con- 
ditions of the cranial cavity, my task being here somewhat enlarged as compared 
with Hoadley and Pearson’s, and I shall then finally enter upon the question of 
capacity calculation, where the chief interest turns upon the possibility of esta- 
blishing an interracial formula which in accuracy can compare with the best 
intraracial formulae. 

I am fully aware that the inclusion of so many problems renders it difficult 
to deal thoroughly with each separate question and that the non-homogeneous 
nature of the material constitutes in certain respects a considerable weakness. 
Where the results must be regarded as only provisional, that fact will appear from 
the text, and by constantly taking into Eu^count the error of differences (statistically 
determined), I hope to avoid the dangers involved in the relative scantiness of the 
material. 


II. Instrument, Technique, Material, Error of Measurement. 

The instrument For the purpose of comparison it is necessary fii'st to give 
a very brief description of Weinert’s instrument. As shown in Fig. 67 in Weinert’s 
work, this consists of two slightly curved steel rods, the angle between which can 
be regulated by means of a hinge. Along these rods run two sliders, with the 
same curve as the rods themselves. After the instrument has been introduced into 
the cranial cavity the rods are fixed in suitable positions and the sliders are pushed 
forward until they touch the inside of the skull. It is noted how far the sliders 
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have been pushed forward, and when the instrument is then taken out it is brought 
back to the same position as before. The distance between the tips of the sliders 
is thereupon read off by aid of a millimetre-rule. As it is always a question of 
maxitnurn measurements, this procedure must be repeated a few times until it is 
c<?rtain that the highest value has been found. 

In the new instrument two flexible bands of steel are fixed to a metal bar 
about 12 min. wide, which is graduated in millimetres on each side. By aid of a 
small slide the steel bands can be moved outwards and the distance between their 
tips is found as the sum of th<^ figures read off on the scale behind the sliders. 
The “head” of the instrument is provided with a small roller, which ensures that 
the steel bands go straight outwards, and care is of course taken to preclude any 
lateral deviation. The bands are given a breadth of 10 mm., but their forward 
ends are rounded (^ff so that the surface of contact with the bone will be barely 
2 mm. wide. The dimensions of the head are made as small as possible, namely, 
12 X 24 mm., a size vvhi<*h renders possible an investigation of the smallest human 
ciania, nor will the majority of anthropoids be likely to offer any difficulties in 
this respect. Of great importance is the fact that the instrument is provided 
with a mirror, so that we can inspect the interior of the skull beforehand and 
during the measuring can see where the tip of the band touches upon the wall. 
It is of course necessary to light up the cranial cavity by means of a small electric 
lamp, which is introduce<i through the foramen magnum. As the skull during 
measurement lies with base upwards, this presents no difficulties. The skull must 
be well and firmly supported on a hag filled with mustard-seed or the like. It is 
sufficient to provide the instrument with one mirror, Jis the posterior point of 
measurement can always be viewed directly. I here leave out of consideration 
the me«'xsurement of internal breadth and height, since as regards these dimensions 
it is of minor importance to be able to see the points of contact (Fig. 1 (a), 

(^), (c)). 

T shall now make a comparison between Weinert’s instrument and the one 
here described. As I personally have no experience of Weinert’s instrument, and 
as his own particulars about the technique are scanty, I must refer to Hoadlcy 
and Pearson’s abovt‘-mentioned paper in Bionietrika, Speaking of Weinert’s 
publication in the Anthrop, Anzeiger {ZO), they say on p. 85: “No directions for 
the use of the instrument are provided with it nor in the above paper, only in the 
latter we find a diagram illustrating what is apparently the median sagittal plane 
of the skull with the instrument set for measuring the length. In this diagram 
the maximum foraminal length is almost as great as the distance from the basion 
to the tip of the dorsum sellae ; with such a magnitude of the foramen it is possible 
to place the hinge of the measuring circle (Weinert’s callipers) entirely within the 
foramen as in the diagram. With crania with more moderate foramina we have 
not found this possible, and greater difficulty may then arise in taking the internal 
measurements ; it is less easy to grope for the maximum diameter.” From what 
has been said it will appear that the advantages of the new instrument are the 
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be read off direct from the metrument, and (4) we therefore have leee difficulty in 



of ? the flexible steel bands have to be handled with a certain degree 

of caution. Especially, they must never be pushed forwanl with any great foie; 

[• IB this wholly in accord with the etatement in the laet lines of the previous pamgraph f Bn.] 



94 


Endocranial Diameters and Indices 


the ends should just barely touch the walls of the skull. The fear naturally existing, 
that the bands will bend on meeting a solid wall, proves to be quite unfounded*. 
Numerous control measurements made with a view to this latter possibility have 
shown that the instrument’s own limit of error is less than ^ mm. in a range of 
0 to 200 mm., and as regards our matei'ial — and for anthropological material alto- 
gether — greater dimensions are hardly likely to come into consideration. It is 
of course obvious that this error of the instrument itself — which may here be left 
out of account, since all measurements arc read off to the nearest millimetre — must 
not be confused with the actual error of measurement in determination of the 
diameters of the cranial cavity, where difficulties occjusioned by the peculiar nature 
of the material also come into consideration. As regards this point, I may refer 
to the last portion of the present section. 

Also as regards the technique of measurement in general we may in all essentials 
refer to Hoadley and Pearson’s article. Although this latter was, as stated, unknown 
to the author when the present material was being investigated, the mode of pro- 
cedure, so far as it appears from the directions given, has been in principle the 
same. Of the greatest importance is Hoadley and Pearson’s remark on pp. 86 — 87, 
that the maximum longitudinal diameter of the cavity does not lie in the median 
plane — of which we can easily convince ourselves by examining sectioned skulls — 
but about 1 cm. to the side thereof, so that the measuring points correspond to the 
most prominent parts of the frontal lobe or of the posterior lobe. “The reader who 
will examine a sectioned skull will find that the median sagittal section is ridged 
to a greater or less extent both anteriorly and posteriorly ; we have the crest for 
the attachment of the fixlx cerebri, the crista galli of the ethmoid, and the internal 
occipital protuberance and the sagittal ridge associated with it” (p. 86). For this 
reason it is practically impossible to take an internal longitudinal measurement in 
the median sagittal plane, and apart therefrom such a measurement would be 
of no interest, since the purpose of the measurements of the cranial cavity is 
to eliminate the influence of the bone thickness. But if we are thus obliged to 
abandon the obtaining of a longitudinal diameter in the median plane, we must 
on the other hand be careful constantly to have the points of contact for measure- 
ment lying strictly sagittally; in other words, the perpendicular distances from 
the points of contact to the median plane represented by the crista sagittalis 
must be equal in front and behind, so far as it is possible to secure this by 
estimation with the eye. To the above directions given by Hoadley and Pearson 
it should furthermore be added that we must always take care that the posterior 
point of measurement stands free of the furrow of the sinus sagittalis or the 
latter’s deflection into the sinus transversus. As the sinus transversus is generally 
more strongly developed on the right side than on the left, it would lead to our 

* In order to strengthen the bands the author got Mr Gundersen to make a new instrument where 
two flexible bands are riveted together on each side of the bar. This new instrument, which has not been 
used as yet, looks exactly like the former, but the bands are very much strengthened. It might perhaps 
be possible to combine three bands on each side, thus securing still more strength without reducing 
their flexibility. [If the “fear” be “quite unfounded” why this need to strengthen the bands? Bn.] 
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getting too large a mean value for the longitudinal diameter of the right side of 
the cavity if we were not observant of this source of error. For the sake of com^ 
pleteness it should further be mentioned that — as follows moreover as a corollary 
from what has been said above — the measuring points for the maadmum bread^ 
must lie in a transverse plane and, of course, be strictly symmetricar on the two 
sides. But otherwise we must also here — ^as in every maximum measurement- 
take into account the individual peculiarities of the skull. It will very soon be 
discovered that the me^ximum breadth of the cavity generally lies in a plane very 
near to the centre of the occipital condyles, a regularity which is probably not 
merely fortuitous but duo to mechanical factors associated with the upright manner 
of walking. (Cf. the anthropoids, in which the maximum breadth of the cranial 
cavity is constantly found considerably in front of this plane, provided the skull is 
orientated in the Frankfurt horizontal. See also Mollison’s work, Statik dea menach- 
lichen Schddela (18). An examination of prehistoric skulls with a view to this 
question would not be without interest.) 

It remains to bo mentioned that the two height measurements, internal bcCaion 
height and internal opiathion height^ have been found by aid of a vertically placed 
rod, gradtiated in millimetres, which is shown in Fig. 2 (p. 96). The skull is orientated 
in the ear-eye plane by means of Mollison’s craniophor. We are aware that the 
furrow of the sinus sagittalis is included in these measurements, but at this place 
the furrow is so slightly developed and probably varies so little that we may be 
permitted to disregard its influence. Of the two height measurements most im- 
portance is assigned to the first-mentioned, because it comes natural to compare 
it with the external measurement, basion-bregma height. With a view to the 
determination of capacity we have also calculated a third height, namely, internal 
ear-hregma heighty which has been found indirectly by subtracting from the 
external measurement the individually determined thickness of the os parietale. 
This latter value was, it is true, mejisured at the lower, posterior corner of the bone 
(right side and about 2 cm. above the sutura mastoidea) — through the foramen 
magnum by aid of ordinary callipers — and not at the bregma, but the difference 
is probably negligible. 

McUeriaL As to the material I need only say that it belongs to the Anatomical 
Institute in Oslo, the Principal of which. Professor K. E. Schreiner, has not only 
been so kind as to place the skulls at my disposal, but has even allowed me 
to make use of material assembled for another purpose. This latter concerns 
Norwegians and Lapps. A systematic treatment of the whole material of Lapp 
skulls, far more comprehensive than that dealt with here, now lies completed 
from Schreiner’s hand (82). Although I thus, from considerations of time, have 
not included all the skulls of Lapps and Norwegians that stood at my disposal, yet 
I have not made any selection, except in so far as only adult skulls in good 
preservation have been employed. The two groups consisting of Australians and 
Maori will form the foundation for another work by myself, already commenced, 
on the craniology of the Polynesians. 
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Originally the material consisted of 328 skulls of Norwegians, Lapps, Maori 
and Australian aborigines. After I had finished the study, I measured 41 Eskimos 
to check some of the more doubtful results. The means, the o-’s and t;’8, for all 



the measurements are given in Table XIII. I have also measured 10 anthropoid 
skulls, likewise belonging to the Anatomical Institute. These few skulls will 
serve to demonstrate that the same technique may be applied to this material. 
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It now remainfi to set forth a few fignres illustrating the noowmy of measure- 
ment, What is of interest here is the error of measurement in determination of the 
length of the cranial cavity, where the difficulties are greatest. As described in 
detail by Hoadley and Pearson (p. Ill), this error can be determined in the 
following manner : We take two sets of independent measurements and calculate 
i (Lj, + Lr) for both. If the individual observations in the first and second measure- 
ments are designated by zi and zt respectively, then x 0*67449 will give “the 
probable error of a single measurement/* 

In our own material, this value was found to be 0*7406 mm. (flr*»1098 mm.) 
for 324 skulls, since 4 skulls were not available for the second measurement, 
whereas Hoadley and Pearson find the error to be 1*2048 mm., with use of 
Weinert’s instrument. 

We see that the error of measurement is reduced considerably (by as much as 
38 %), which is not an improbably great reduction when we consider the advantages 
of our instrument, especially the fact that the points of measurement can be seen. 
With greater practice I regard it as possible to reduce the probable error to less 
than 0*6 mm. 


III. The Anthropological Importance op Internal Measurements and 
Indices. Thickness of the Roof of Skull. Variability and Correlation. 


As mentioned in the Introduction, our first object is to investigate to what 
extent the craniological differences of race expressed by the three main diameters, 
and the indices computed on the basis thereof, are accompanied by parallel 
differences in the form and size of the brain. Table I gives a survey of the most 
important absolute measurements. Here, as in the other tables, m denotes “standard 


error, i.e. 
indicated. 




Where “probable error,** i.e. 0*67449m, is used, this fact is specially 


Norwegians and Lapps are relatively well represented, while the values for 
Maori and Australians can only be taken as provisional, without this fact having 
necessarily any groat influence as to the questions here dealt with. It is seen that 
the difference in sex comes well into evidence, both in the external and in the 
internal values. The maximum length of the cranial cavity (length of the cerebrum) 
is in women from over 3 to about 7*6 mm. less than in men, while the breadth is 
from about 4*6 to about 6 mm. and the height from about 3 to 6 mm. less in 
women than in men. These differences are from 2 to 9 times their standard 
errors, only the difference in height of brain-box in Australians being statistically 
veiy doubtful (less than twice the standard error of difference). Further, it is clearly 
evident from the internal measurements that the Australians* brain is not only 
very narrow but also extremely short, while on the other hand it shows considerable 
height, not only relatively but also al^olutely regarded. Before proceeding to 
illustrate these points means of indices we shall see what Table I has to tell 
about the bone thickness in our 4 groups. Subject to the reservation that an 
Biometrika zzvn 7 






K. WAaNBE 


99 


indirect determination of bone thickness can jdeld only approximately correct 
figures^ and that the upper point of measurement for No. 6 does not correspond 
exactly to bregma (No. 3), | (2 — 5) will give the bone thickness at euryon and 
3 — 6 the approximate thickness at bregma. 

The values given below (Table II) are, however, the means of the individually 
computed differences (not the differences between the means) f, whereby it becomes 
possible to take the standard errors also into account. For bomparison with the 
computed bregma thickness, the thickness of the os parietale (right side) at the 
lower posterior comer is also given, measured directly through the foramen magnum. 


TABLE II. 



Norwegians 

Lapps 

Maori 

Australians 


i 

n=:81 

9 

n=76 

n=68 

? 

n-65 

6 . 
n.=16 

9 

»=14 

s 

n=18 

9 

nsslO 

Thiokness at 
euryon 

4-16 

±0-09 

3*57 

+0-10 

3*24 

±0-11 

3*41 

±0-12 

3*28 

±0*14 

3*04 

±0-20 

3*73 

±0*30 

3*50 

±0*22 

Thickness at 
bregma (computed) 

5*25 

±0-20 

4*32 

±0*18 

4*16 

±0*19 

3*44 

±0*23 

6-19 

±0*53 

4*64 

±0*47 

7*16 

±0*64 

4*80 

±0*77 

Thiokness of os 
l>arietale (direct) 

5-33 

±0*17 

4-38 

±0-13 

4*30 

±0*15 

3*98 

±0*14 

6*88 

±0*36 

6*79 

±0*32 

• 

7*39 

±0*66 

6*70 

±0*32 


We see that the difference of the directly and the indirectly determined thick- 
ness of the 08 parietale is in general small, being in 4 groups less than i mm. 
In Lapp women, Maori men, Maori women and Australian women the difference 
is, however, not inconsiderable. In the two latter groups the computed bregma 
thickness is probably about 1 mm. too small, which may easily be explained by some 
irregular contours, where immediately behind the bregma the cranial roof has risen 
relatively much. 

In the next place it is seen that the skull-cap is, as might be expected, some- 
what thicker at the bregma than at the euryon and that the male skulls are in 
general a little thicker than the female. As regards the racial differences revealed 
in Table II, it is natural to compare Lapps with Norwegians and Maori with 
Australians. As might a priori be expected, the values for Lapps are lower than 
the values for Norwegians and those for Maori in general lower than for Australians. 
The difference in the former case may be regarded as statistically certain. If we, 

* Owing to the small dimensions here involved we must, in an examination aiming directly at the 
thiokness of the skull roof, demand a greater aoouraoj of measurement than 1 mm., and measurements 
preferably taken with an instrument specially constructed for the purpose. Such an instrument was 
(according to Martin) designed by P^n as early as 1896. 

[t If the two are not the same, this must be a fault of arithmetic, not of measurement. Ed.] 

7—2 
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for example, compare the thickness at the euryon in male Lapps (3’24) and in male 
Norwegians (4*15), the difference is seen to be 0*91 mm., or 6*5 times the standard 
error of the difference (0*14). The difference between Maori and Australians, on 
the other hand, is statistically less well founded, but it can hardly be doubted that 
in a larger body of material the difference would be well beyond the statistical 
error. It would be of interest to ascertain whether the Australian skull, the 
thickness of which at the bregma is very considerable, is not thicker at the euryon 
than the skulls in an average European material, as seems to appear from Table II. 

Our figures accord well with what others have found : Hoad ley and Pearson 
state the thickness at the euryon to be 3*52 mm. and at the bregma* 5*46 mm. 
(male Egyptians). Todd (28), as the average of 448 sectioned “white crania” (male), 
found by direct measurement 3*56 mm. at the euryon and 5*88 mm. at the vertex, 
figures which, together with ours, confirm on the whole Beddoe’s table of the 
values that must be deducted from the external breadth and external height in 
order to arrive approximately at the internal dimensions (see Martin, p. 726). 
Only we must make the reservation that the variation — as Todd points out — is so 
considerable that the table can be applied only to mean values. 


We now come to the thickness of the frontal hone at the glabella. In order to 
find this value wo must from the difference between external and internal length 

deduct the thickness of the os occipitale at the posterior point of 
measurement • This latter dimension, which has been found directly, is 


likewise to be seen from the following table. (We have to disregard the fact that 
the anterior point of measurement for the internal length does not exactly corre- 
spond to the glabella-point from which the external length has been measured, 
but this circumstance will certainly have no disturbing influence on the results.) 


TABLE III. 



Norwegians 

Lapps 

Maori 

Australians 


? 

(? 

9 

<j 

9 , 


9 

Thickness of 

OH occipitale 

Thickness of os frontiile 
at glabella 

4-21 

±0-11 

11*22 

±0*33 

3-9U 

±0*13+ 

9*16 

±0*27 

3*01 

±0*13 

8*11 

±0*40 

3*06 

±0*14 

6*33 

±0*34 

4*89 

±0*24 

10*71 

±0*62 

4*54 

±0*25 

7*43 

±0*46 

6*19 

±0*39 

14*36 

±0*83 

6*42 

±0*31 

11*68 

±0*74 


[* Hoadley and Peareon measured the bone>thioknesB vertically above the basion, which is not the 
bregmatic thickness. En. 

t This is the value given in Table XIII, but if the entries in that table from which it is deduced be 
correct, it should be 9*28, which will modify the value for Norwegian - Lapps given in the following 
paragraph from 2*82 to 2*90. En.] 
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Tcdcing first the occipital bone, we again note that the Lapp skulle are the 
thinnest and the Australian skulls the thickest, although the racial difierences are 
not particularly pronounced. As before, the difference between Norwegians and 
Lapps is statistically well established, being for the male skulls 1*20 mm., or 
7 times the standard error of difference, while the difference between Maori and 
Australians is statistically doubtful. In males it is 0*30 mm. (mDif.— 0*48 mm.). 
Further there is seen to be a strikingly small difference between the two sexes. 
In Lapps and Australians the difference found actually goes in the opposite 
direction to what should be expected. Quite different is the situation as regards 
the glabella-thickness. Here the difference between the sexes goes in the same 
direction in all 4 groups, and lies between 1*78 and 3*28 mm., mj}ir, lying between 
0*43 and 1*11. The sexual difference is from 2 to 5 times the standard error of 
difference. And if we further compare Norwegians with Lapps and Australians 
with Maori, we find, to take the male skulls first, a difference between the former 
two of 3*11 mm., and between the latter two of 3*64 mm., that is to say, respectively, 
6 and 4 times mnif. (0*62 and 0*98). The female skulls give quite similar figures : 
Norwegians — Lapps 2*82 mm., or 6 times mDif. (0*43); Australians — Maori 4*16 mm., 
or 5 times minf, (0*87). My figures for the glabella- thickness thus also accord 
veiy well with earlier findings: Todd's material (male “white crania”) gave 
11*26 min. (cf. our Norwegians) and Hoadley and Pearson's male Egyptian skulls 
showed a difference between external and internal length of 16*10 mm., whereof 
the author's estimate ^ to be frontal and f occipital, that is to say, a glabella- 
thickness of about 9 mm. (lying between Lapp and Maori). 

The marked sexual and racial difference in the glabellar region which I have 
here demonstrated is not surprising. In the first place, a well-developed torus 
supraorbitalis is one of the features to which most importance is attached in 
determination of sex, and as regards the Australians, Mollison ( 17 ) has emphasised 
the great development of the glabella particularly in contrast to the Maori. The 
flat glabella in Lapps is likewise a well-known phenomenon, which moreover we 
have seen is accompanied by generally thin skull-bones. In connection with my 
investigations of the thickness of the skull-wall at the euryon and bregma we con, 
however, from the figures here given draw certain conclusions of interest for our 
main subject : the correlation between form of brain and of skull. 

I. Since on going over to internal measurement the length is reduced much 
more than the breadth and the breadth, again, more than the height, all three 
internal indices will probably on the average show larger values than the corre- 
sponding outer indices. The differences here concerned recur again in all 8 groups 
and they are so considerable that this rule may probably be deemed to apply, if 
not to all, at any rate to most races. The increased breadth-length index has 
repeatedly been observed in casts of the cranial cavity, cf. Schwalbe (as), who as 
early as 1899 could state, on the basis of examination of individual skulls: “Es 
zeigt sich, dass der Sdhfidelinnenraum des Erwachsenen, wie bei der Mehrzahl 
der Affen, einen grdsseren Index besitzt, als der aussen gemessene Schfidel” (S. 36). 



102 


Endocranial Diameters and Indices 


II. Even though the same tendency is present in all groups, it is not likely to 
make itself equally evident in all of them. In the races which show the greatest 
development of the glabella region the increase in the breadth-length and 
height-length indices will be most pronounced — in our material especially in the 
Australians. 

By a more searching examination of my figures it would undoubtedly be 
possible to draw other and more detailed conclusions as to the probable '‘brain- 
indices,” but we can attain the same object in a simpler manner by a direct 
comparison of external and internal indices, as in Table IV. 


TABLE IV. 



Norwegians 

Lapps 

Maori 

Australians 

External indiceH 









d 

9 

6 

? 

d 

9 

s 

9 


Brcadtli -lengt/h 
index 

75*88 

±0-33 

75*50 

±0*31 

81*90 

±0*52 

83*17 

±0*49 

73*87 

±0*73 

74*07 

±0*66 

71*38 

±0*51 

71*92 

±0*83 

Height-leiij^th 

index 

70-4G 

±0*40 

09-90 

±0*34 

72*0.5 

±0*39 

72-62 

±0*39 

73*15 

±0*40 

72*35 

±0*50 

74-21 

±0-64 

76*40 

±0*00 

Iloight-breadtli 

index 

1)3*08 
± 0*.58 

92*07 

±0*52 

88*19 

±0*55 

87*39 

±0-47 

99*23 

±1*22 

98*20 

±1*13 

103*99 

±1*22 

105-04 

±0-82 

Internal indioen 









Breadth-length 

index 

77 *50 
±0*37 

77*24 

±0*29 

83*46 

±0*50 

83*77 

±0*46 

77*12 

±0*71 

75*30 

±0-64 

75-35 

±0*02 

76*27 

±0*90 

Height-length 

index 

73*89 

±0*45 

73-84 

±0*39 

74*45 

±0*45 

74*72 

±0*42 

75*94 

±0-73 

74*74 1 
±0*52 

78-07 

±0*90 

80*01 

±1*21 

H eight-breadth 
index 

94-82 

±0*64 

94-48 

±0-64 

89*40 

±0*60 

89*37 

±0*51 

99*28 

±1*22 

99*30 

±1*14 

104*48 

±1*48 

107*06 

±0-71 


On passing to internal measurement the breadth-length index is increased 
by from 0’60 to 3*97 index-units, the height-length index by from 2*10 to 5*21 
and the height-breadth index from 0*05 to 2*01 index-units. These figures accord 
very well with what might be expected from our knowledge as to skull thickness. 
As regards the last-named index the increase is sometimes insignificant, and in 
individual cases there will of course frequently be found a deviation in the 
opposite direction, but for the material as a whole the increase must, in spite of 
the relatively large mean errors, be regarded as a fact, since the same tendency is 
met with in all 8 groups. Meanwhile, if we wish to examine whether the anthro- 
pological differences in shape are lessened or further increased on passing to 
internal measurements, it is chiefly the first two indices that will be of importance. 
The breadth -length index has in the highly dolichocephalic Australians increased 
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by 3*97 and 3*35 index-units for males and females respectively, in the Maori by 
respectively 3*25 and 1*29 index-units, in Norwegians (mesocephalic on the border of 
dolichocephaly) by 1*68 and 1*68 index-units, and in the brachycephalic Lapps by <Mily 
1*56 and 0*60 units respectively for males and females. Herein we see a distinct 
tendency to reduce the racial differences, once again as a result of the formation 
of the glabella, — the increase being also as a rule greater in males than in females. 

As regards the height-length index the figures suggest a somewhat different 
situation. Named in order of succession fix»m the partly hypsicephalic Australians 
to Norwegians on the border of chamaecephaly the following values are found for 
the index increase: Australians respectively 4*46 and 5*21, Maori 2*79 and 2*39, 
Lapps 2*40 and 2*10, Norwegians 3*43 and 8*88. The racial difference between 
Norwegians and Lapps is lessened on adopting internal measurement, while as 
between Australians and Maori it is increased. The strongly developed glabella of 
the Australians thus partially conceals the considerable relative development in 
height of brain. Had our material included a chamaecephalic type with fiat glabella 
(if such a type exists), as well as a hypsicephalic group with strongly developed 
glabella, the increase in racial difference would have become still more clearly 
manifest. 

The results of these investigations of shape of skull as contrasted with shape 
of brain* can thus be provisionally summed up as follows: 

1. All three brain-indices have shown increase as compared with the indices for 
the skull. The difference is greatest in the height-length index, less in the 
breadth-length index and least in the height- breadth index. 

2. The racial differences are constantly less clearly manifest in the breadth- 
length index of the brain thaA in that of the skull. The general validity of this 
rule is of coui*se contingent on a rather strong negative (spurious) correlation 
between glabella-thickness and external length-breadth index. In my own material 
I have found the following figures for the coefficient of correlation (r) between 
the properties mentioned: 

Norwegians, male, — 0*2960 ± 0*101 ; female, — 0*2126 ± 0*110. 

Lapps, male, — 0*2856 ± 0*117 ; female, —0*2621 ± 0*126. 
Interracially this negative correlation will undoubtedly be still more pronounced. 

3. On comparing a hypsicephalic race having a relatively strongly developed 
glabella with a chamaecephalic race with relatively flat glabella, the height-length 
index for the brain will show greater racial differences than the skull index. 

Rules 1 and 3 need no further substantiation from our knowledge of the thickness 
of the skull roof at the glabella, euryon and bregma. 

If we are to have any hope of penetrating more deeply into our problem of the 
relation between shape of skull and shape of brain, we must also take into considera- 

^ Let it be mentioned agiiin — to avoid misaaderstandinga — that ** shape of brain** is here used as 
synonymous with shape of skull cavity. 
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tion the conditions of variation and correlation. What is the chief interest for us 
here is to compare the distributions of the external with the corresponding internal 
dimensions (Table V). 

Hoadley and Pearson find that the internal length is "considerably more variable 
than the external length, while in the case of the breadths and heights the 
difference in the coefficients is negligible, being well within the probable error/' 
In our material there is found no conformity to rule in this respect; sometimes 
the internal length, sometimes the internal breadth or height is more variable than 
the external. The difference as a rule is slight, and for each group taken separately 
does not offer much to build upon (cf. the "standard error” of standard deviation), 
but such difference as is found generally tends in the direction that the internal 
measurement is more variable. If we keep to the variation coefficients and count 
together the 4* deviations for the brain, we find that 32 out of 48 point in the 
direction of the brain being more variable and 16 out of 48 in the opposite direction, 
while 1 shows no difference. On examining the absolute measurements and the 
indices separately the following figures are obtained: 15 +, 8 — and one 0 deviation 
for the absolute measurements, and 17 -f and 7 — for the indices, and of these latter 
7 — deviations five fall to the internal breadth-length index, while the height-length 
index of the brain is in all eight groups more variable than that of the skull. In all 
cases it can be concluded from these figures that at any rate we must reckon with 
an (intraracial) variation of shape and size of the brain which is not less than that 
of the skull. This result is surprising in so far as we should have expected the 
considerable variation in thickness of the skull to make itself strongly felt. An 
investigation of the correlation between internal measurement and bone thickness 
supplies the explanation; for Hoadley and Pearson have shown that the bone 
thickness is negatively correlated with the corresponding internal measurement, 

» thus giving — 0*36448. A similar correlation is found in our material. 

As appears from Table VI (p. 106) the thickness as represented by the 
glabella-thickness is in all four groups negatively correlated with the length of 
the cranial cavity. The accompanying errors are, it is true, so large that we must 
content ourselves with a summary consideration, but for the material as a whole 
hardly any doubt can prevail as to the significance of this negative correlation, 
especially in view of Hoadley and Pearson's investigations. We also find that the 
euryon-thickness in two groups is negatively correlated with the internal breadth, 
while in two groups the correlation is without significance. 

The most natural explanation of these negative correlations is the following: 
either (a) the roof of the skull exerts an inhibitive influence on the growth of the 
brain, or (6), conversely, a relatively considerable growth of the brain will have a 
reducing effect on the thickness of the cranial roof, or (c) both fitetors come into 
play at the same time. It is not xK>ssible on the basis of the available data to 
penetrate more deeply into the causative relations. And neither is an investigation 
of the correlation between bone thickness and the three main indices of the brain 
particularly illuminating in this respect. As is seen, all these correlations (bone 
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TABLE VI. CorrelatimiH, 



Norwegians 

Lapps 

Corrolatod Pair 


? 

d 

$ 


«^81 

71 = 70 

n=63 

71 = 66 

(flttl>ella> thickness & internal length 

- *344.1 + *098 

- •2961 ± *105 

-•2354 ±-119 

-*1467 ±-132 

Glabella-thickness & internal breadth 

- *2442+ *104 

-•2407 + *108 

-*2283±-119 

- *0761 ± *134 

Glabella- thickness & internal basion -height 

-*0431 fill 

- -1605 ±*11 2 

+ -0168+ -126 

-•0868 ±-134 

Euryon-thickness & intornal breadth 

+ *(X)79±*111 

-•2145±*110 

-*1617±*123 

-•0762 ±*134 

(llalHilla-thitikness & internal breadth -length index 

-f- *0870 +*102 

+ *1027±*113 

+ •0435 ±*126 

+ •0743 ±*134 

Glabella-thickness & internal height-length iinlex 

4-*1791 + *108 

+ •0421 + *115 

+ -2108±*120 

+ •0396 + *135 

Glabella-thickness & internal height-brimdth index 

+ *0942 ±*110 

+ •0236 ±-115 

+ *1521 ±-123 

-•0304 ±*1.35 

Euryon-thiokncss & internal breadth -length index 

-•0486 ±*111 

- *0284 ±-115 

+ -0694+ *125 

-•0097 ±*136 

Internal breadth-length index & capacity 

-•0357 + *111 

-*2879 + -105 

-•1319±*124 

-•0068 ±*136 

Internal height-longtli index & capaenty 

+ *0216±-111 

-•2227+*l()9 

-•22.38 ±-120 

-•0410 ±-135 

Internal height-bre;idth index & capacity 

+ ■1393 ±-109 

-•0434 ±*115 

- -0216 + -126 

-•0761 ±*134 

Internal length & internal breadth 

+ -41]8±*092 

+ •4886 ±-087 

+ -1318 + -124 

+ *0.540 ±*134 

Internal length & internal height 

+ •2303 + -105 

+ -.3503+ -UX) 

+ *21.35 + *120 

+ 1269±*133 

Internal breadth & internal height 

+ *1083±-110 

+ •0597 ±-114 

+ *0276±*126 

+ -0724±*1.34 


thickness being represented at glabella and eiiryon) are very slight, only three 
of them exceeding 2w,.. Neither has an examination of the correlation between 
brain-shape and capacity led to positive results. It is true that the correlation 
between capacity and breadth -length index is negative in all four groups, but only 
in one case, Norwegian females, is it of statistical value in relation to We must 
content ourselves with the negative conclusion that, judging from the material 
here dealt withy neither the cranial capacity nor the thickness of the roof of the 
skull has any essential predetermining influence on the form of the brain. Mean- 
while we must bear in mind that an investigation of intra-racial correlations in a 
comparatively small body of material, such as mine, cannot be decisive as regards 
this question. I may mention in this connection that my Australian skulls were 
strikingly thin at the euryon, and the same was the case with the 41 Eskimo skulls 
(18 male, 23 female) which were examined after the main material had been dealt 
with (see Table XIII, no. 6 (c)). As we know, the Eskimo skulls are also excessively 
dolichocephalic. If it should prove to be the case that we are here confronted by a 
general law, namely, that the skulls of the dolichocephalic races are relatively thin 
at the euryon, then it must be permitted to draw the conclusion that a pressure from 
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the side wall of the skull is an essential feu^tor in the shaping of the dolichocephalic 
brain. This conclusion is of all the more interest because it diverges greatly from 
the generally adopted answer to a much disputed question. 

The last three series of coefBcients for the relation between the three main 
diameters of the cranial cavity likewise confirm very well Hoadley and Pearson’s 
results. There is a veiy strong positive correlation between maximum length and 
maximum breadth and likewise between maximum length and internal basion- 
height, but between breadth and height the correlation is practically zero. The 
positive correlation between the two first-named values suggests the presence of 
general factors of size (genetic or epistatic) afiecting both diameters. From the 
fact that the correlation is by no means absolute we may conclude that there 
exist at the same time special shape-determining factors which act more strongly on 
one diameter than on the other, and such shape-determining factors create an 
antithesis in the relation between height and breadth of brain which is here 
sufficiently strong to mask the effect of the general size- factors. 

IV, Shape of the Brain in Prehistoric Man and in the Anthropoids. 

It is natural to compare the above figures for the three main indices of the brain 
with the values previously found from casts of the cranial cavity in prehistoric 
specimens. As early as 1899 Schwalbe (28) compared the internal breadth-length 
index in the Pithecanthropus, Neanderthal -Kalotte, Spy I and Spy II with that of 
some isolated recent skulls. None of these prehistoric specimens showed exceptional 
values: Pithecanthropus 80*0 (in measurements by Dubois and Weinert (81)), 
Neanderthal-Kalotte 79T (after other measurements 78*2 — 80*0), Spy I 76*9, 
Spy II 81*3*. Further can be mentioned: La Chapelle-aux-Saints 78*3 (Boule and 
Anthony (4), p. 134), Rhodesia 79*4 (Hrdli6ka(ii), p. 121), Gibraltar 79*0 (Hrdlidka, 
p. 169, "close to”), Ehringsdorf 79*0 (HrdlidSka, p. 239, after Weidenreich’s "recon- 
structed 1926 skull”), LaQuina 73*8 (Hrdli6ka, p. 291), *Le Moustier 76*7 (Hrdli6ka, 
p. 301, after Weinerb’s reconstruction “close to*^). 

Of the prehistoric finds here mentioned only the La Qnina has had a relatively 
narrow brain as compared with our recent material. All the other figures lie above 
the average for our Australians and not far from the mean values for recent 
mesocephalic Europeans. 

Schwalbe also established the fact that the brain of the anthropoids was 
relatively broad, and the figures given by him for breadth-length index have on 
the whole been confirmed by later investigators: Chimpanzee 86*6, Gorilla ^ 84*6, 
Orang J 96*0. Unfortunately Weil’s (89) technique is so different from Schwalbe’s 
and ours that we cannot make use of his measurement of “ endocranial casts,” and 
Connolly’s (7) excellent paper deals with fixed brains, but it can nevertheless be 
regarded as established that the orang-outang is farthest removed from man as regards 
this index, next comes the chimpanzee, while in the gorilla the index partly coincides 

* It is not quite clear from Schwalbe’s work whether the two Spy figures have been based on oasts of 
the cranial cavity. At any rate, according to Schwalbe they give ** die Innenform des Schiidels.” 
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with the mean figures for recent and prehistoric man. The considerable variation 
found especially in the gorilla (different species ?) has been pointed out by several 
investigators, Keith (18), Bolk (2), Harrises). Keith (p. 78) gives the following figures 
for endocranial breadth-length index: 

Chimpanzee 84*5, ? 84T (var. 79 — 92, w = 19). 

Gorilla ^ 78-8, ? 81*3 (var. 72 - 88, « * 47). 

In our own material we have found the following values: 

Chimpanzee: 81*9. 

Gorilla : 79*5 - 82*8 ~ 84*2 - 82*4 - 82*2. 


Orang: 85*7 - 90-2 - 87*8 - 84T. 

None of the three groat anthropoids lies entirely outside of the range of variation 
for man I 

With respect to relative height, it is a well-known fact that most prehistoric 
skulls have a height-length index which is far below that of recent human skulls 
and in a manner forms a bridge between homo sapiens and the great anthropoids. 
Here, however, we still lack reliable figures which could give us an idea of the 
situation as regards the brain. 

For the La Ckapelle-aux- Saints skull Boule and Anthony give an internal 
height-length index of 

( hauteur basilo-bregmatique x 100\ _ ^ 

longueur max. / 

Rhodesia, 70*6 (computed from Hrdlidka’s figures: endocranial basion-bregma- 
height 120, maximum length of cerebral hemisphere 170). 

Gibraltar, 70*6 (“close to,*' computed from Hrdlicka’s figures for greatest 
length of brain 163 and biisi on-bregma height less thickness of frontal bone near 
bregma 116). 


Le Moustier. 75*0 (after measurements of Weinert^s reconstructed model skull, 
internal length 176, internal basion-height 132). 

It must be deemed surprising that these values also lie within the range of 
variation for our recent material. As regards the Chapelle-aux-Saints skull the index 
is, no doubt, very low, but no fewer than 6 of our 81 male Norwegian skulls show 
still lower values. Neither can the internal height-breadth index be said to be 
exceptional in the above-mentioned skulls: La Chapelle-aux-Saints 86*9, Rhodesia 
88*9, Gibraltar 89*8, Le Moustier 97*8. In this respect we find that 4 of the 
aforesaid 81 recent crania lie below the lowest figure, that for the La Chapelle-aux- 
Saints. 

As regards the anthropoids Connolly *8 measurements of fixed brains have shown 
that the difference in height-length index between chimpanzee, gorilla and orang 
is not particularly great ; 


5 male chimpanzees 

36*1 

6 male gorillas 

38*1 

13 male orangs 

36*5 
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But these fibres do not of course admit of direct comparison with ours, where 
the technique is quite different. I’or our own (unfortunately scanty) anthropoid 
material we have found the following values for internal height-length index or 

length of BLll-cavity ) ' ’^‘“8 orientated m the Frankfurt plane : 


(i 


Chimpanzee 69*8 

Gorilla 73*2, 776, 71*7, 664, 746 

Orang 79*0, 838, 81*6, 72*9 


All these figures lie within the range of variation for our recent human material! 
We see how necessary it is also in case of the anthropoids to procure a large body 
of material for comparison before we can utilise the general shape of the brain 
for elucidation of the phylogeny. I consider it futile to proceed to speculations 
respecting the phylogeny on the basis of one or more indices, so long as we have 
not a large body of material at our disposal and so long as we have not through 
investigations of correlation secured a more certain basis for correct evaluation 
of the individual figures. So far as my figures reach, it is permissible to conclude 
that most prehistoric men, and especially those hitherto best investigated, had 
a mesocephalic brain with a breadth-length index which does not differ mate- 
rially from that found in many recent races of man, while the height-length 
index was probably considerably under the mean values for recent races. This 
increased development in relative height of the brain will accordingly be the real 
problem so far as regards shape. With respect to the anthropoids, my in- 
vestigations have confirmed the prevailing assumption of brachycephaly, especially 
for the orang-outangs, whereas we find to our surprise that the height-length 
index is not particularly low, provided the technique employed is kept constantly 
the same. Investigation of a large body of material is here highly necessary, in 
order to ascertain the mean figures for the different species of anthropoids. 


Furthermore we can assert that the difference in general shape of brain between 
anthropoids and prehistoric and recent races of men is not nearly so great as cranio- 
logical investigations had hitherto led us to suppose, but it is possible that future 
investigations will render it justifiable to attach very great importance to the 
difference that is found to exist. It will contribute very much to a more precise 
definition of the problems if we consider brain-form and skull-form to some extent 
separately. It must be due to a confusion between the two — or perhaps rather to 
deficiency in the figures — when Bolk (8) designates “ the primitive races of man,*' 
including the Homo Neanderthalensis, as “dolichocephalic," while at the same 
time the primates are stated to be “ generally brachycephalic." 


If we consider the shape of the skull cavity — and otherwise we shall not be 
able to draw comparisons — then the primates are certainly brachycephalic as com- 
pared with man, but most of the prehistoric human skulls which have hitherto 
been examined must be deemed to be mesocephalic and not dolichocephalic as 
stated by Bolk. 
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V. ENDOCRANrAL ASYMMETRY. 

It might a priori be considered a comparatively simple matter to ascertain 
which hemisphere was the heavier, or which showed the greater development in 
length) but nevertheless Hoadley and Pearson quote a large number of papers 
dealing with this problem without its being possible to say that their excellent 
survey of the literature makes the matter very much clearer. Boyd, Broca, Topi- 
nard, Ogle, Inglessis have published data pointing to “ pre-eminence of the left 
hemisphere.” Investigations made by Thurnam, Wagner, Braune and Wilde 
point in the opposite direction, while Bastian and Reichard leave the question 
unanswered, the former after having occupied himself therewith for a number 
of years. 

According to Braune and Wilde the greater development of the right cerebral 
hemisphere is associated with a preponderance of the left cerebellar hemisphere. 
According to Inglessis the brain is “asymmetrical, but this asymmetry is confined 
chiefly to the occipital part of the hemispheres, and here the left hemisphere pre- 
dominates” (Hoadley and Pearson, p. 106), in strong contrast to Broca, who found 
the preponderance of the left side to be confined to the frontal lobe (cf. the speech 
centre). 

In the following we shall mainly compare our results with those of Hoadley 
and Pearson, whose technique is the same as ours. 

As average of 729 tnale Egyptian skulls, they find 

171*0446, <7 *5-9551, 

1^*170-0501, 0-* 5*6908, 

that is to say, a difference of 0*9945 mm. in the length of the two hemispheres, 
with preponderance on the right side, a difference which, as the authors show, 
cannot be due either to “error of measurement” or to “random sampling.” “In 
other words, if we may take our sample of over 700 male Egyptian skulls as 
representative, the right hemisphere is in the main of greater length, and therefore 
probably of greater capacity than the left”(p. 111). As stated in the Introduction 
our figures point in quite the opposite direction : 

TABLE VII (a). 



Norwegiann 

Lapps 

Maori 

Australians 

i 

n=:81 

9 

n=76 

<? 

n = 63 

9 

n = 66 

c? 

71 ssl6 

9 

n=14 

n = 18 

9 

ns 10 

Lr 

171*94 

166*78 

167*84 

161*42 

170*31 

167*67 

163*30 

166-90 

Lx. 

172-64 

166*46 

168*25 

161*71 

172*13 

168-60 

164*15 

166*90 

Li.-LR±mnf_ 

0*60 

0*68 

0*41 

0*29 

1*82 

0*93 

0*85 

1*00 


±0*194 

±0*154 

±0*159 

±0*187 

+ 0*385 

±0*416 

±0*359 

±0*379 

Ll~Lr ± probable error 

0*60 

0*68 

0*41 

0*29 

1*82 

0*93 

0*86 

1*00 


+ 0*131 

±0*104 

±0*107 

±0*126 

±0*260 

±0*280 

±0*242 

±0*266 
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In all 8 groupB the left half of the cranial cavity is on the average somewhat 
longer than the right. The differences are from 2*3 times to 7'0 times their 
probable errors, using the formula 
0*67449 

The differences are thus without doubt statistically reliable, not only for the 
material as a whole, but also for most of the separate groups. The correlations 
are obviously rather close : 

Norwegians : 4* 0*961, Norwegians ? : + 0*974. Lapps : -f 0*983, Lapps ? : + 0*957. 
Maori ^ : 4- 0*944, Maori ? : 4- 0*937. Australians 4- 0*946, Australians ? : 4 0*979. 

On measuring a second time the following figures were found, which we give 
in parentheses in order to avoid confusion : 

TABLE VII (6). 



Norwegians 

Lapps 

Maori 

Australians 

6 

9 

6 

9 

<? 

9 

(? 

9 

Ll 

(171-83) 

(172-65) 

(0-82) 





(167-29) 

(168-21) 

(0-92) 

(163-68) 

(163-76) 

(0-17) 

(155*90) 

(166-80) 

(0-90) 


We see that the concordance with the first set of measurements is very good. 
Not only do we find again in all groups the greater length of the left half of the 
cranial cavity, but there is also an unmistakable correlation between the lateral 
deviations in the first and the second set of measurements. Only in the small 
group of female Australians is the difference materially reduced, and here one 
skull was lacking at the last measurement. The conclusion we can derive from 
our material as compared with Hoadley and Pearson*s therefore is that the asym- 
metry of the two hemispheres as regards length is not, as has hitherto been supposed, 
a characteristic for man as a species, hut varies from group to group, probably with 
preponderance of the left side in most cases^. This view is confirmed by certain 
other investigations which Hoadley and Pearson do not mention in their survey 
of the literature. Thus Hrdli5ka (lO), who has measured the length of the three 
cerebral fossae on sectioned skulls, found the sum of the lengths to be greater for 
the left side : " In seven of the ten series of human adults the average of the sum 
of the lengths of the three cerebral fossae on the left exceeds that of the right 

* It is here aBsamed that Hoadley and Pearson’s technique corresponds exactly to ours. If we omit 
to light up the cranial cavity and to inspect the posterior point of measurement, this latter will sometimes 
fall in the furrow for the sinus transversus, which is known to be more developed on the right side. Of. 
in this connection what is said later about the correlation between development of the fossa occipitalis 
sup. , depth of the sinus transversus and length of the hemispheres. Of. also especially Elliot Smith’s 
article (B6), which also actually concerns Egyptian material. 
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side by from 1 to 3 mm. There is therefore a clear, though small, excess of fofi»al 
length on the left in the mass of human adult crania” (p. 199). This excess on 
the loft side the author finds to be due to the greater length of the left posterior 
fossa (posterior lobe), just as Inglessis found in hardened brains. Cf. Hrdli6ka, 
p. 193: “The most striking feature of the above data is the evidence that in all 
the groups and series the left (posterior) fossa is the longer, which is the reverse 
of what was observed with the middle and especially with the anterior cavities.'* 
The author does not seem to have determined the error of measurement, nor does 
he add a and to his mean figures (the article was published as early as 1907), 
but the material is comparatively so large (92 skulls) that it must be said to have 
a certain weight in this connection. A paper recently published by Connolly ( 7 ) 
points in the same direction. On 120 hardened brains the lengths of the two 
hemispheres were measured by aid of an apparatus designed by the author himself 
and the accuracy of the results was checked by renewed measurements, partly 
with change of technique. The material comprised Negroes (37 males, 9 females), 
Germans (36 males, 10 females), miscellaneous Whites (18 males) and Philippines 
(10 males). In 4 of the 6 groups (100 cases) the left hemisphere was from 1 to 
3 mm. longer than the right, and only 1 group (10 cases) showed a difference 
of 1 mm. in the opposite direction. Of the whole material the right hemisphere 
was longer in 31*7 %, the left was longer in 60'0%, and both hemispheres equally 
long in 18‘3 % (p. 483). My 41 Eskimos are in accordance with the rule : 
18 males In ^ 176 06, 177*00, thus giving lx, — 0*94 mm.; and 23 females 

Ijj = 167*26, Zjo — 168*22, thus giving — Ir == 0*96 mm. 

All in all, Hrdli^ka’s, Connolly's and my own investigations, together embracing 
581 cases, must be said to afford adequate proof of the greater length of the left 
hemisphere in certain groups of the human population. But it would of course be 
premature to draw from this the conclusion that the left half of the brain has in 
the same groups been heavier than the right. In measuring the greatest breadth 
of the cranial cavity it was strikingly noticeable that the distance from the median 
line — this latter being determined by the eye, with the longitudinal diameter of 
foramen magnum and the palatine suture as guiding line — was generally greater 
on the right than on the left side. Owing to the rough and ready determination 
of the median plane the figures given below make no claim to particularly great 
accuracy, but they are recorded here as a first attempt to determine the trans- 
verse asymmetry of the cranial cavity. 

— Z, maximum distance from the “median line” of the cranial cavity : 
Norwegians ^ (81) Norwegians ? (76) Lapps ^ (63) Lapps J (55) 

3*12 mm. 1*76 mm. 2*16 mm. 1*73 mm. 

Maori </* (16) Maori ? (14) Australians ^ (13) Australians ? (10) 

1*06 mm. 1*93 mm. 2*31 mm. 2*80 mm. 

If we are allowed to assume that the median plane of the skull cavity — as 
determined above — corresponds to the falx cerebri, or what comes to the same, 
that the medial surfaces of the cerebral hemispheres are nearly symmetrical to this 
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plane, then the above-mentioned figures go to show that the right hemisphere has 
been the broader in these groups. 


In order to verify these observations I have measured the corresponding 
maximum distance from the median plane on horizontal contours through the 
glabella and upper edge of the orbit. As can easily be ascertained, a contour at 
this level will go very near to the eoiyon, and a possible deviation will in any case 
not be of importance for our problem. Such contours of 166 Lapp skulls (91 male 
and 75 female) were placed at my disposal by Professor Schreiner. They had teen 
prepared for another purpose after P. and F. Sarasin’s system by aid of Martin’s 
Kubus-craniophor and diagraph. The following figures show that the same excess 
on the right side again appears. The measurements have been taken with an 
exactitude of mm. 

Medea Females 


Standard error 
R «= 75*17 ± 0*32 mm. 

I = 74*20 ± 0*28 mm. 
R-Z= 0-97 ± 0-238 mm • 


Standard error 

S « 73*10 ± 0*36 mm. ^ 
I 71*33 ± 0*32 mm. 
1*77 ± 0*290 mm.o 


The distance out to the right side is both for males and females greater than 
out to the left. In males the difference is 4mdif. and in females If the 

skulls in which the right side is the broader are designated by +, we get among 
the 91 males, 52 +, 29— and 10 zero-deviations; in the 75 females, 61 -h, 11 — and 
13 zero-deviations. Of the total diagram -material a -h deviation to the right is found 
in 62%, to the left in 24%, and no demonstrable difference in 14%. Technical 
errors may of course render the above results illusory, but the probability thereof 
becomes less according as the material is larger and more data can be obtained 
pointing in the same direction. It is natural to compare my findings with the type 
contours which have been published in Biometrika since 1911. As regards the 
technique we may refer to Benington and Pearson’s article : " Cranial Type Contours,” 
Vol. viii (l)t- On glabella horizontal sections the parallel No. 7 will give the most 
adequate expression for the greatest breadth of the skull and a comparison of the 
distance from the median line on each side affords very good information as to the 
average transverse asymmetry. Unfortunately some of the series are rather small, 
and if we examine n which gives the number of skulls, we see that two figures are 
sometimes given for one series. Obviously a single skull of the series in question 
has been defective with the result that the right or left side could not be measured. 
If the series be small, this circumstance will render it useless for our purpose. 
I have therefore in Table VIII omitted the series consisting of less than 10 skulls, 
in cases where the right or left side was not measured on one or more of the skulls. 


[* If the oorrehitionB between R and L are anything like those shown on p. Ill, these probable 
errors are curiously high. Ed. 

t The reader should examine this carefully as much turns on the manner in which the median 
lines are found in the horizontal and transverse contours. Yon Bonin’s technique is not that of the 
Biometric Laboratory. Ed.] 

Biometrika zxvn 
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Altogether the table embraces 42 series. These show a + deviation to the 
right in 32 cases and to the left in 10 cases, and on adding our own investigations 
of the dimensions of the cranial cavity the numbers will be 40 and 10 re- 
spectively, as we here disregard the croniograms of our Lapp material, since 
this has in part been included in the 40. Moreover, as already mentioned, these 
craniograms also proved to be in accordance with the rule. Of course, the distri- 
bution in some of the smaller groups will be fortuitous, while technical errors 
undoubtedly may be responsible for some of the minor deviations (Congo Batetelu, 
Eskimo, Morant*s and Woo’s Egyptian crania, Ban tarn -Batavia and Mittel Java, 
as w'ell as Negro crania from Teita Hills), but for the larger series with average 
difference in breadth of more than 1 mm. it is very improbable that this source of 
error has played any important rdle. This assumption can easily be confirmed by 
comparison with the corresponding “ transverse vertical contours ” which are 
likewise shown in Table VIII. It is seen that on these latter contours 4* deviation 
in 27 cases goes to the right, in 14 cases to the left, while in 1 case the deviation 
is 0. If we confine the examination to the groups in which the number of skulls 
ecpials or exceeds 20 and where the excess in breadth on one side is equal to or 
more than 1 mm. on transverse contours, we find that in 14 out of 15 cases the 
excess is on the right side and only in 1 case on the left, which altogether is a 
good proof that the technical errors have not been so great that the rule (the 
preponderating + deviation on the right side) loses its general validity, even though 
we cannot venture to attach any decisive weight in this question to the individual 
investigations. It must therefore be said to be of minor importance that 37 Basque 
crania or a single group of 31 Eskimo skulls show + deviation to the left, but it is 
of greater interest to note that out of 9 English groups there is only 1 (Hythe 
female crania) which on horizontal contours diverges from the rule. Or, if we 
prefer also to take into account the transverse contours, we find that of 18 devia- 
tions 16 go to the right and only 2 to the left (male Anglo-Saxons, transverse 
contours, and female Hythe crania, horizontal contours). Of Egyptian material 
we have 6 series which without exception, both on transverse and on horizontal 
contours, are in accordance with the rule, whereas the Oriental material (Burmese, 
Tibetan A and B, Nepalese, Hindu) strikingly often show a -I- deviation to the 
left. I thus obtain confirmation for my earlier expressed assumption that cerebral 
asymmetiy is probably a racially determined characteristic (associated with the 
quantitative proportion of left-handedness in the population ?), although for hitherto 
investigated European material it can be regarded as a general rule that the left 
hemisphere is the longer and the right side of the cranial cavity is the broader*. 

In this connection I must mention a paper published by Woo ( 84 ) which to 
some extent confirms my findings. Woo heis, like Hoadley and Pearson, investigated 
a very large body of Egyptian material — 887 male skulls from the 26th to the 
30th Dynasty — with a special view to asymmetry conditions. His technique differs 

* The plausible oonjecture that the greater breadth of one side represented a compensation for the 
greater length of the other has not been confirmed by our material. The correlation [Z.ji - and - Bx,] 
is practically equal to 0 and is in any case without statistical significance. {r=s - 0*060^^0*065.) 
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very inuch from mine, as only external measurements were taken and then chiefly 
confined to the individual bones. While paying full regard to possible sources of 
error, the author concludes that a preponderance on the right side is more frequent 
than on the left. Out of 25 measurements 16 show a + deviation to the right 
and, as appears from his tables, this excess is most pronounced for the os parietale 
and the os frontale. On the other hand it is noteworthy that the occipital arc from 
the lambda to the asterion “ is very significantly greater on the left,’* and although 
the two other occipital measurements give contrary results, it therefore seems 
rather premature when the author on the basis of the excess on the right side in 
the frontal and parietal region draws the conclusion that the right hemisphere is 
more strongly developed (p. 332)*. To my former conclusion I shall here merely 
add that the question of a general preponderance of the right or of the left 
hemisphere can hardly be decided by cranial measurements, and in carrying out 
direct weighing — which will certainly be adopted very largely also in the future — 
the anthropological character of the material ought not to be entirely disregarded. 
It would be of great interest to examine the larger lobes more systematically than 
has hitherto been the case, and judging from my material I should expect to 
find right-sided preponderance of the parietal and left-sided preponderance of the 
occijfital lohCf here again with certain reservations for anthropological reasons. 

Relation between fossa occipitalis sup., sinus trans versus dexter et sinister 
and length of the hemispheres. 

So far as can be seen, Elliot Smith was the first to point out that the upper 
posterior fossa, which so clearly marks off the caudal pole of the posterior lobe of 
the brain, is in general deeper and more fully formed on the left side than on the 
right (26). It wjis natural to connect this finding with the main outflow of the 
venous blood through the right sinus transversus. As mentioned in most text- 
books on anatomy and as can easily be ascertained on inspecting a series of crania, 
the furrow is more strongly developed on the right side than on the left — Le 
Double (16) — while the foramen jugulare also shows distinct difference on the two 
sides. I must agree with Elliot Smith that it is natural to suppose — but only 
as a provisional hypothesis — that the greater development of the left posterior 
lobe is the cause of the main outflow of the venous blood being to the right. And 
the next step in the process of deduction really follows of itself: the greater 
development of the left posterior lobe is in some way or other connected with the 
predominating use of the right hand, so that by examining the fossa occipitalis 
superior and the sinus transversus in prehistoric man and in anthropoids we can 
obtain interesting information as to the phylogenetic development of “right- 
handedness.” Elliot Smith has later reverted to this question on several occasions, 
for instance in an article on “Right and Left-handedness in Primitive Men” ( 26 ), 
where it is stated, inter alia, that the Pithecanthropus skull-cap and the London 
skull both show inverse occipital asymmetry, for which reason both of them are 

[* It is in the frontal and parietal regions on the left side that functional centres have been authorita- 
tively placed and are asserted to give predominance in weight and size to that side. En.] 
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thought to have belonged to left-handed individuals. Here, however, we already 
meet with the first difficulties, since it is found that the sinus transversua in both of 
these cases runs to the right. For there at once arises two questions of fundamental 
importance in this connection: (1) Does the deeper and more marked fossa occipi- 
talis sup. sinistra give expression for a greater development of the left posterior 
lobe ? (2) Can there be demonstrated any correlation between the craniological 
characters, the development of the fossa occipitalis sup. and the depth of the sinus 
transversus?* To the first question Elliot Smith originally thought he could give 
an affirmative answer. For he found on hardened brains that the area striata, 
i.e. ^'the area of cortex containing Qennari's stria," extended far more widely over 
the outer convex surface of the hemisphere on the left side, and he interpreted 
this finding as a sign that the left area strata was much more diffused, but later 
in 1907 he adds — “which subsequent investigation has shown to be erroneous” 
(p. 574), and he describes in this connection an attempt to measure the area striata 
directly on 10 brains. As the results did not confirm the original assumption, he 
advances the theory that it is the greater development of certain other cortical 
regions which causes the right area striata to fold itself largely on the medial 
surface of the hemisphere, and he mentions specially as a possibility Flechsig^s 
large parietal association centre, which is stated to be much larger on the right 
side. This, too, is of course merely a conjecture, but it seems to be worth mention- 
ing on the background of my own and Woo’s investigations respecting cranial 
asymmetry. That the left region of vision is not more developed than the right 
has subsequently been established by Cohn and Papez(6). Measurements of the 
fissura calcarina on 100 hardened brains — and according to Campbell (6) the length 
of the fissura calcarina gives expression for the extension of the region of vision — 
gave 8*8 cm. for the right side and 7*9 cm. for the left. In no less than 72 % the 
furrow was hmger on the right side, as against 10 % longer on the left. These 
measurements are unaccompanied by information as to the influence of error of 
measurement, but the authors verify their results by direct planimetric measure- 
ments, which led to practically the same result. I shall pass from this question 
without bringing into the discussion the difficult problem of “left-handedness." 
I refer to the above-mentioned paper by Cohn and Papez, and above all to Woo 
and Pearson’s (88) : “Dextrality and Sinistrality of Hand and Eye,” where the latter 
authors’ reserved attitude to the question finds expression in the accompanying 
motto : “ Keine ErkliLrung ist aber doch besser als eine oder mehrere irrthtlmliche.’’ 

Further Elliot Smith believed he could answer affirmatively the second question. 
He states that in a series of Egyptian skulls (which we may certainly be permitted 
to assume to have been large in number) about 80 % show marked asymmetry, 
with left-sided preponderance of the fossa occipitalis sup. (25, p. 576), and at the 
same place it is said that in the cases where there was distinct divergence the main 

* A third and not less important question is whether the inverse occipital asymmetry reaUy is 
associated with left>handedness. This question £Uiot Smith claims to be able to answer in the affirmative 
and he refers to a comparative investigation made by Wood- Jones in 1926 (85) respecting the length of the 
bones of the upper arm. 
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direction of the transverse sinus was to the left. No statistical proof of this relation- 
ship is furnished, however, and Tildesley, who took up the same question in 1920, 
arrived at the following negative result: "One has to conclude therefore that in 
this series— 127 Burmese skulls — there is no significant association between outer 
occipital asymmetry and the direction in which the main occipital sinus is turned 
when its course is changed from longitudinal to lateral” (27, p. 267). Meanwhile 
Tildesley points out that this external examination of the squama occipitalis is not 
decisive for the question discussed by Elliot Smith, which thus still remains 
unanswered, at any rate from a statistical standpoint. It is on this point that our 
material is calculated to cast some light. 

The characters here concerned are in their manifestation of a quantitative nature, 
although it is not inconceivable that the underlying biological causative factors 
are alternative (dextrality or sinistrality). Meanwhile it is practically impossible 
quantitatively to measure the size of the posterior fossa or of the sinus transversus. 
On the first examination of the material I have noted the depth (size) of the 
sinus transversus in three categories, r, I and ?, and on the second investigation 
noted the development of the posterior fossa in the same manner. The cases where 
the one groove was deeper, but the other of greater breadth, are assigned to the 
?-group. As regards the statistical treatment of the data, I have arranged my 
material into 9 cell-systems and to these applied “the root-mean-square con- 
tingency,” the formula for which is 


<f>® *= - 2 

'an 41 * 


where Upq stands for observed and Vpg for theoretically expected frequencies. This 
dimension is compared with 


</>* for zero-correlation = * 


± -67449 


/2c 


where c is the number of “cells,” and further, for the sake of completeness, Ca 
(coefficient of mean-square contingency) has been reckoned out from the formula 




V i + 


I have found it justifiable, on the basis of our earlier investigations of asymmetry, 
which revealed the same conformity to rule in all groups, to deal with the material 
collectively, without regard to sex or race. 

We will first examine the relation between length of hemisphere and size of 
fossa occipitalis sup. 

Greater Length of Hemisphere 



Right 

Equal 

Left 

Totals 

Right 

24 

14 

12 

60 

Equal 

19 

45 

45 

109 

Left 

9 

53 

104 

166 

Totals 

52 

112 

161 

325 
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From the vertical marginal totals it is seen that our material shows the same 
regularity as regards the fossa occipitalis sup. as Elliot Smith’s Egyptian series: 
51*1% are larger on the left side, against 15*4% on the right side. The group 
“equal” forms as much as 33*5%, which shows that our estimation has been very 
cautious. We see that the group “lefb-left” is by far the most numerous and the 
group “right-right” considerably more numerous than the dissimilar combinations. 

^***01846, as against 0*0246 ± 0*0088 for zero contingency and 0*3947 
(without “class index correction”). We can conclude from these figures that the 
greater development of the left posterior fossa is a very important, possibly the only, 
cause of the greater length of the left hemisphere in our material. 

Relation between fossa occipitalis sup. and sinus transversus: 


Greater Groove 



Bight 

Equal 

Left 

Totals 

Bight 

n 

15 

23 

50 

Equal 


24 

12 

109 

Left 


25 

6 

166 

Totals 

220 

64 

41 

325 


From the horizontal marginal totals it can be reckoned out that the transverse 
sinus in 67*7 % of the skulls goes preponderatingly to the right, as against 12*6 % 
to the left, which accords welK with earlier observations made by Le Double 
(68*5% right, 14*5% left), Elliot Smith and Tildesley (70*1% right, 19*7% left). 
Further we find that 158 skulls are in accordance with Elliot Smith’s law — right- 
left and left-right combinations, while only 18 skulls form an exception. 

^2 = 0*1973, which on comparison with' the earlier mentioned value for 
zero contingency furnishes a good proof of the validity of the law. ((7* = 0*4059.) 

The relation between the sinus transversus and length of the cranial cavity is 
seen from the following figures: 


Greater Length of Hemisphere 



Bight 

Equal 

Left 

Totals 

Bight 

12 

70 

140 

222 

Equal 

11 

33 

20 

64 

Left 

30 

10 

2 

42 

Totals 

53 

113 

162 

328 
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It is seen that the dissimilar combinations are by far the most numerous, the 
contingency is even higher than before: 

0*3810, while for zero contingency is about as before. 

C 2 * 0*5786. 

We thus find a high contingency between length of hemisphere and main 
direction of the sinus transversus, assuming that length of cranial cavity forms an 
adequate expression for length of hemisphere. We have thereby succeeded in 
furnishing statistical proof of the validity of Elliot Smith's law, and by a method 
which — as far as regards one of the characters — is independent of all subjective 
judgment. It is reasonable to suppose^ as does Elliot Smith, that there exists a 
comparatively simple causality relation between the characters, but we cannot 
a priori preclude the possibility that the correlation is due to other circumstances, 
for example, the influence of some common factor (position of the fetus in the 
uterus?). It may possibly be supposed that such (hypothetical) factors may be the 
cause of the correlation not being absolute. In our last table there are two cases 
where the left hemisphere has the greater length while at the same time the flow 
of blood goes mainly to the left, and in twelve cases the situation is the opposite. 
In these cases it may be that factors of otherwise secondary importance have played 
a decisive part. 

Especially it must be noted that our figures tell nothing as to the relation to 
left-handedness. In this respect a systematic investigation of length of hemisphere 
in connection with the size of the sinus transversus and the length of the humerus 
on each side would undoubtedly be illuminating. 

The results of our investigations as to the asymmetry of the skull — and, 
indirectly, of the brain — may be briefly summed up as follows: 

1. Left hemisphere longer than right. (+ deviation to left in 49*4 %, -h devia- 
tion to right in 16 2%.) 

2. Right hemisphere broader than left, (-f deviation to right in 67*1%, certain 
H- deviation to left only in 2*2 %.) 

3. Left fossa occipitalis sup. more developed than right. (Left fossa greater in 
61*1%, right fossa greater in 15*4%.) 

4. Right sinus transversus larger than left. (Right in 67*7%, left in 12*6%.) 

Hereunto may be added that, according to Elliot Smith as well as to Cohn and 
Papez, the right area striata is greater than the left, while at the same time we 
may remember that Hoadley and Pearson's investigations of length of hemisphere 
in their Egyptian material give results different from ours. The most important 
problems remaining in this field accordingly are the following: 

(1) Will direct measurements on fixed brains confirm our findings as to the 
greater breadth of the right hemisphere, and if so, what is the cause of the greater 
breadth (parietal lobe)? 



EL Wagkee 


121 


(2) How are we to explain the far greater development of the left fosea oompitalis 
sup. when at the same time the area striata is less developed on the left side? 

(3) How can we explain the cases in which the sinus transversus deviates from 
the rule ? 

VI. Dimensions of Cranial Cavity and Prediction of Capacity. 

It would lead too far to give a detailed account of the many attempts that have 
been made to set up formulae for calculation of capacity. In this respect we may 
refer to Martin's text-book, while a recently published paper by Neuert<i9) also 
gives a survey of the literature on the subject. It will be found that in the main 
two principles have been followed. The investigators have either (1), cus do Broca 
and Manouvrier. begun by taking the product of the chief cranial dimensions 
(Z, B, H), occasionally with measurement of arcs and circumferences, and then 
introduced a number of corrections in order to reduce the error to the smallest 
possible amount, or else (2) they have, like Lee and Pearson, computed the 
correlation between the directly observed capacity values cmd the product of the 
three main diameters (eventually taking arcs and circumferences) and on the basis 
of this correlation established the best possible regression formula. This latter 
mode of procedure (No. 2) might be designated the empirioal-atatistxcal method 
and there can hardly be any doubt but that it is scientifically better founded than 
the former, which is of a priori nature*. The English-speaking, German and 
Scandinavian anthropologists have by degrees united in adopting the English 
method, and the formulae most frequently employed are the following f : 

I. Lee’s interracial formula, computed from 8 different races. 

Male C - 0000 S70. L.B.0H + 32116 cm.» 

Female (7« 0000 Z7b,L,B,0H ^ 296*40 cm.» 

II. Pearson 8 interracial formula. 

Male C = 0*000 365. L.B . OH 4- 359*34 cm.® 

Female G « 0*000 375. L.B. OH ^ 296*40 cm.® 

In case the ear-height is not known, other formulae come into consideration, as 
to which we may refer to Martin’s text-book. The same applies to the intra-radal 
formulae^ which are of course to be preferred in many cases. The importance of having 
a good interracial formula is, however, obvious. Sometimes the race is unknown or 
doubtful, as, for example, in dealing with prehistoric crania, while in other cases 
an adequate intraracial formula is lacking. Meanwhile it is maintained in several 
quarters that these mathematically determined interracial formulae are not 

* Instead of linear measarements Nenert adopts measurement of plane surfaces (oephalograms), taken 
in different planes of the skull, sometimes several surfaces in the same plane. The advantage of this 
method is that peculiarities in the uhaige of the cranium And better expression. The method seems rather 
complicated and can hardly be said to be fully elaborated statistically, but judging from Neuert's paper 
the results are very good. For X>ray examination of living individuals it undoubtedly has a future 
before it. 

[f As to the attribution of these formulae, see my Note following this Memoir. Bn.] 
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particularly satisfactory, and the reason for this may be found in four factors: 
(1) Influence of bone thickness, (2) development of the sinus frontalis, (3) peculi- 
arities in the shape of the skull and (4) fortuitous circumstances in connection 
with the composition of the material forming the basis for the formulae, both with 
respect to selection of races and to the numerical proportion of the individual races. 
In the paper which has already several times been mentioned Hoadley and Pearson (o) 
therefore investigate the correlation between the product of the three chief internal 
diameters, Li, Bi and Hi (by Hi is unden?tood here the internal basion-height, 
perpendicular on the Frankfurt horizontal) and the capacity. The latter was 
determined “by packing the skull tightly with mustard seed and then weighing.” 
It was found that the correlation had increased from 0*82081 (between capacity 
and external measurements) to 0 89586 (between capacity and internal measure- 
ments). The possibility of obtaining a better formula was therefore obvious. With 
a view to rendering the method more serviceable in practice the authors tried to 
determine Li, Bi and Hi from L, B and H, but this attempt failed owing to the 
slight correlation between thickness of bone and external measurement. The authors 
therefore conclude by stating that “we must either be contented with the degree of 
accuracy provided by the external diametral product, or if we wish to improve on 
that accuracy we must practise the rather difficult technique of internal measure- 
ment” (p. 93). Owing to the great technical difficulties the authors express doubt 
as to the utility of the new mode of procedure. As my new instrument renders the 
technique simpler and more time-saving, while at the same time the accuracy 
of measurement becomes greater, fresh interest is created in the importance of 
internal measurements in determination of capacity. And there can be no doubt 
that the correlation between the internal measurements and the capacity has also 
in our material been considerably increased as compared with the external measure- 
ments, but I have nevertheless refrained from this mode of procedure, as our 
material comprises too few races — two of the races especially (Maori and Australians) 
being too badly represented — to form the basis for an empirical-statistical inter-racial 
formula. Such a formula would, in fact, not have a long life. According as more races 
were investigated it would be necessary to make small changes in its different factors. 
I have therefore selected another mode of procedure, which no doubt is theoreti- 
cally to be compared with the a priori method, but which I think rests upon a 
better foundation than all previous attempts in that direction, and which in practice 
will give better results than Lee's and Pearsons formulae. If we disregard the 
individual peculiarities of the cranial cavity and compare it to a regularly shaped 
space, then it is most natural to think of the shape of an egg, and the stereo metrically 


known body that will first come into consideration is the ellipsoid, the simple 
formula for which. 


C 


L.Br.H 

6 


314169, 


it should be possible to apply to the three main diameters of the cranial cavity 
As height of the ellipsoid we have chosen the internal ear-height (external ear- 
bregma height minus thickness of the os parietale) instead of the internal 
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baeion-height or internal opiathion-beight, which owing to the greatly varying 
clivus-angle would give less satisfactory results. And instead of the maximum 
breadth of the cavity alone I have for the breadth of the ellipsoid used the mean 
of greatest breadth of cavity + maximum frontal breadth. If we inspect a series of 
skulls from one or more race in the norma verticalis, we soon become aware of 
the greatly varied tapering oi the anterior pole of the skull. It is mainly this 
circumstance that forms the basis for Sergi’s classification of skull t 3 rpes into 
Pentagoids, Ellipsoids, Rhomboids, etc.; cf. contours on pp. 688 — 689 of Martin's 
text-book. By introducing the maximum frontal breadth in addition to the internal 
measurements we therefore hope to eliminate or diminish three of the previously 
mentioned sources of error. The ideal procedure would of course be to use the 
internal frontal breadth, but this dimension is very difficult to obtain and, besides, 
the thickness of the bone at this place is of veiy little importance. 

The formula we employ is therefore the following: 

<7= 0 0006236 . £, . ^ . Hi. 

where Li = — ^ “ greatest breadth of the cranial cavity, *» greatest frontal 

3*14159 

breadth. Hi = internal ear-bregma-height and and 0*0005236 *« 0 x iOOO ’ 

We shall first see how this formula suits our material without an additive term. 
The directly observed capacities in the following table have been found by means 
of the mustard -seed method and measuring-glass. Ranke's bronze skull has 
invariably been employed for checking the packing. 

TABLE IX. 



Norwegians 

Lapps 


Australians 


6 

? 

6 

9 


9 


9 


n = 81 

n = 76 

n = 68 

n»55 


n=rl4 

n:=:18 

71 = 10 

Directly observed C 

1466 

1293 

1473 

1301 

1418 

1327 

1278 

1113 

±14*3 

±11*4 

±13*8 

±11*3 

±20-2 

±23*3 

±21*9 

±17*3 

Computed C,.. 

1234 

1093 

1247 

1123 

1195 

1109 

1072 

931 

Observed-computed C 

222 

200 

226 

178 

223 

218 

206 

182 


The best possible intraracial formulae are accordingly obtained by adding to the 

( ‘B\ “4“ J5 \ 

. Hi as additive term the values given in the lowest 

column. The next step in the investigation should be to apply these formulae so 
as to ascertain the individual divergencies. Here we lack, however, intraracial 
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formulae for comparison, and therefore we proceed at once to use the interracial 
formula. It is seen that the additive terms show remarkably little variation. This 
is of course a necessary condition for a good interracial formula when the other 
factor should at the same time be kept constant. On weighting the additive terms 
with their respective n-values we get the following interracial formula, which we 
will at once compare with Lees and Pearson's*: 

For </■ skulls C = 0 0006236 U . • Hi + 222 cm.» 

For ? skulls G =- 00005236 Z, . (--|~*) • Hi + 192 cm* 

The following Table X gives us an opp<:»rtunity of comparing the mean figures 
for the three methods with the direct measurements. 


TABLE X. Mean figures. 



Mustard 

Seed 

Lee 

Pearson 

Ellipsoid 

formula 

Mustard 

Seed 

Lee 

Mustard 

Seed 

Pearson 

Mustard 

Seed 

Ellipsoid 

formula 

Norwegians ^ 

1450 

1443 

1463 

1456 

4-13 

- 7 

0 


9 

1293 

1279 

1279 

128.5 

4-14 

4-14 

4* 8 

Lapps 

cT 

1473 

1419 ! 

1444 

1469 

4*54 

4-29 

4- 4 


9 

1301 

1294 

1294 

1315 

+ 7 

4- 7 

- 14 

Maori 


1418 

1428 

1450 

1417 

- 10 

^32 

4- 1 


Q 

1327 

1300 

1.301 

1301 

4-27 

4-26 

4-26 

Australians i 

1278 

1333 

1357 

1291 

-65 

-79 

-16 


9 

1113 

1178 

1178 

1 

1123 

->65 

-65 

-10 


It is seen that in six groups the ellipsoid formula gives better results than 
Lee’s and Pearson’s methods and in one group (Lapps ? ) the difference is only 
7 cm.® in the opposite direction. In the group Maori $ the difference is insignificant. 
Lee’s and Pearson’s formulae suit very badly for our Australian skulls. Somewhat 
better results would here be obtained by following Wacker’s instruction to add 
60 cm.® for thick-walled skulls (Martin, p. 647 — really intended it t>nly to apply to 
Lee’s method). The results would, however, still be worse than with the ellipsoid 
method, and moreover Lee’s and Pearson’s formulae would in that way lose their 
inter-racial character. 

Table XI shows the mean individual divergencies, without regard to plus or 
minus sign. 

It is seen that the ellipsoid formula brings us on the average from 5 to 53 cm.® 
nearer to the directly observed values. 

* In the additive terms we entirely neglect the decimals, which are without practical importance. The 
exact figures would be 222'86 and 192*66 cm*. After the figures 222 and 192 had been employed one of 
the Australian skulls was rejected as being defective, which explains the slight divergency as regards 9 . 
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TABLE XI. 

Average Individual Divergency, without regard to sign. 



Ijee 

Pearson 

Ellipsoid 

formula 

Norwegians A 

54 

55 

47 

n $ 

45 

45 

33 


69 

56 

35 

« $ 

40 

40 

35 

Maori A 

28 

41 

23 

9 

42 

42 

37 

Australians A 

69 

85 

32 

.. 1 

65 

65 

22 


Here again it is the Australian material that gives the worst results by Lee’s 
and Pearson’s methods. A deduction of 60 cm.* renders the figures somewhat more 
favourable, but they still continue to be worse than by my new method, the average 
divergency being 45 or 68 cm * for males and 29 cm * for females as against 
respectively 82 and 22 cm.* by the ellipsoid formula. 

An investigation of the standard deviation of the divergencies with zero as 
mean has given the following results: 

TABLE XII. 

Standard Deviation of Divergencies from directly observed C. 


Race 

Mustard Seed 
Lee 

Mustard Seed 
Pearson 

Mustard Seed 
Ellipsoid 

Norwegians A 

70-1 ± 5-5 

72*8± 5*7 

58*0±4*6 

» 9 

55-6 ± 4-5 

65*6 ± 4*6 

41*0 ±3*3 

Lapps A 

82*6 ± 7-3 

69*1 ± 6*2 

44*3 ±3*9 

9 

53-2 ± 5-1 

53-2 ± 5*1 

41 *3 + 3*9 

Maori A 

33*8 ± 6*0 

46*2 ± 8-2 

31 -5 ±5-6 

„ 9 

62-2 ±11*8 

62*2± 11-8 

46*8 ±8*8 

Australians A 

75*1 ±14*7 

94*3 ±18-5 

37*6 ±7*4 

I 

72*3 ±16*2 

72-3±16*2 

28*6 ±6-4 


Now it might naturally he objected against the comparison here made that the 
ellipsoid formula has been applied to a material which had served as basis for the 
establishment of the same formula. As the additive term is the only factor which 
varies, the question to be considered is the infiuence of that factor on the comparison. 
There are two circumstances which reduce the weight of the objection. Firstly, the 
fact that our material includes two such widely divergent races as Lapps and 
Australians, wherefore it is probable that the formula will give correspondingly good 
results also for races which as regards shape, capacity and thickness of skull lie 
between these eztremea And, secondly, the objection will not apply at all to the 
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two groups, female Lapps and female Maori, where the mean figures for the 
capacities computed by the ellipsoid formula are not better than those obtained by 
Lee's and Pearson’s methods. It is seen from Table XI thcU even in these cases the 
average individual divergency is reduced^ although only hy 5 cm} Although the 
advantages of the new method can hardly be doubted, yet for control I have 
applied it to an entirely new material. For this purpose a series of Eskimo skulls, 
18 males and 23 females, likewise belonging to the Anatomical Institute, was 
selected. On account of the peculiar wedge-shaped tapering of the parietalia this 
material should provide an especially good test. From the figures given below it 
app^rs that here also the ellipsoid formula gives much better results than the 
other two methods: 


Eskimos. 


Mean values 

Directly observed C 
Lee 

Pearson ... 

Ellipsoid formula 

Average divergency 

Lee 

Pearson 

Ellipsoid formula 


Male Female 

1526 cm.’ 1635 cm.’ 

1480 „ 1318 „ 

1602 „ 1318 „ 

1622 „ 1336 „ 

76 cm.® 48 cm.* 

69 48 „ 

40 35 


The standard deviation of the divergencies from the directly observed capacities : 
For males: tr Mu.Urd seed: Lee 76-6 ± 12*6, 

^ Mustard Seed: Pearson 77 0 i 12 8, O’ Mustard Seed: Ellipsoid 51 3 i 8 55. 

For females: <r Mustard seed: Lee 53*5 ± 7*9, a Mustard Seed: Ellipsoid 46*8 ± 6*8. 
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[* 13u«e TaUies ate not in aooordanoe with the statements in the first column. ? replace bj 125*756, 3*43, 3*045, 6*345, 121*97 and 118*085 respectively. En.} 




TABLE XIII (continued). 


K. WaON3B» 


129 



Biottstrlka xtm 


1 163*72^. ^ [t This value and the results on eith^ side dependent on it are remarkable, and if correct show how Httle 
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[* ? Bead instead of these Tespeetively 3*39, 8*535 and 5*61. £d.] 
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NOTE ON SECTION VI OF DB K. WAGNER’S MEMOIR. 

By Kaxl Piabson. 

1. I find it difficult to follow the reasoning of a portion of Dr Wagner’s most valuable paper. 
But before I enter into any critioisin, it is needful to remind the reader of one or two points which 
all of us are apt to overlook. There are two types of formulae respectively termed by me many 
years ago, intrarctcial and inUrraeial, If we wish to predict a chai^ter like the capacity of the 
skull which is laborious to measure, or it may be impossible, more easily ascertained lengths, 

such as external diameters, then we require to know the most probable vidue and the standard 
error, which determines the range of variation about this most probable value. The statistical 
method is definite— provided we make certain assumptions — i.e. we measure the dependent 
character desired and the independent characters in as many individuals of the same race as are 
available and construct a regression surface, and theory then gives us the probable value and the 
standard variation about that value of the character desired in terms of the independent variables. 
On the assumption that cranial characters approximately follow what are termed normal distribu- 
tions, the resulting regression surface is planar, and the standard variation depends on the 
multiple correlation coefficient. The result thus obtained enables us to predict from the given 
set of independent variables the desired character in any number of the same race with a given 
likelihood of accuracy. Such a construction formula is termed an intrarcusud formula. 

As long as the hypothesis of normality is sensibly true, and therefore our formula linear, the 
mean of any desired character in a number n of crania of the same race will be obtained with 
the same accuracy by inserting in the formula the means of the independent variables. 

If we now take the means of the desired character in a number of races and the corresponding 
means of the independent characters, these means will follow normal distributions, if the 
individual characters do ; and we construct in the same manner a planar regression surface for 
ascertaining the mean desired character in any new race from the means of its independent 
variables. Such a regression surface is termed an inUrraeial formula. 

The process is perfectly straightforward if laborious from both the measurement and computing 
sides, once the hypothesis of normal distribution of cranial characters is accepted ; and, as theory 
demonstrates, no other planar formula with the eame independent variables can give as good, 
much lees better, results. The formulae, whether obtained by the theory of multiple regression or 
by the method of least squares, are identical 

In the above manner intvaraoial reconstruction formulae were worked out for five races, in 
1901, by Dr Lee and myself*. We Worked them out for various combinations of the Length, 
Breadth and Height of the skull, including the product of these diameters. These intraracial 
formulae gave by no means bad results and average formulae from our best material. Bavarian* 
Aino and Naqada, gave us general formulae for the two sexes (loc. oit, p. 243), namely, 

( (f ) *000337/i5-fir4' 400-01 \ 

( 9 ) C7- •0004MLBE+20Q.60 } 

for an individual skull of any race where O is the capacity in cubic centimetres and the 
diameters are measured in millimetres. 

L is the maximum (glabellar-occipital) length, B is the maximum breadth taken on the 
parieUda^ H the height measured from the auricular line perpendicular to the Frankfort Flam, 


^ PhUoeopMeal TrantaetioM JR. 8. Vol oxcvx A. pp. 225-— 264. 
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It must be remembered that the mean capacity of a number of crania can only be obtained 
approximately from these formulae by inserting the mean values of LBH ^ — for the mean value 
of the product {LBH) is not the product of the mean values of X, 2?, and H. 

Now let us turn to interracial formulae. Few investigators have measured enough crania of 
a large number of sufficiently diverse races to obtain a good interracial formula, whether it be a 
linear or product formula. The use of a large number of racial capacities measured by different 
observers may l>o most misleading ; not only may the technique be different, but variations due 
to personal equation are excessive. The older French measurements cannot be very safely pooled 
with the German, and the older English measurements with either*. In the course of the Boyal 
Society memoir of 1 901 Dr Lee and I r<\ached by the method of least squares interracial formulae 
{los. cit. p. 247) for male and female series. These are as follows : 

(^) 000365 Zxi5xF+ 369-34 \ 

(9) -000376 ZxZxZf + 296-40 / ^ 

where H is again the mean auricular height measiu-ed perpendicular to the Frankfort Plane. 

These results were obtained from the moan capacities, and the mean diameters of 9 or 10 
races; for this reason they cannot be applied to an individual skull of any special race. They are 
formulae for mean values : cf. what has been said on p. 133. 

I have referred to the difficulty of getting capacity measured in the same manner by different 
observers and techniques, and accordingly in a footnote Dr Lee and I gave the male formula with 
three doubtful series omitted, i.e. 

(<J) C«*000370XxXx.Sf +321-16 (iii). 

To my surprise Dr Wagner breaks up Dr Lee’s and my joint work, and provides two sets of 
formulae. He attributes to Dr Lee (ii) and to me (iii) for males and the second of (ii) for females. 
It has the appearance of our having j>rovided two sets of different and partly discordant 
formulae ! Why he should se].>arate out a result in a footnote to a joint memoir as peculiarly my 
fundamental formula for racial capacities I am puzzled to say 1 

On the same page (lac. cii. p. 247) are given the interracial formulae 

iS) C'- -0002662 xJx^' + 624-6 \ 

(9) 6^=-000166Zx.gx^' + 8120 j 

where £[' is the mean height of the skull measured from the basion to the point of the skull 
vertically above the basion when orientated to the Frankfort horizontal plane. 

Now the reader will note that neither H {^^OH ai later notation) nor XT' are measured to the 
bregma. 

It would have been theoretically more reasonable to take instead of L the length L\ the gerade 
Ldnge of the Frankfv.rter VerstAndigung^ for then L\ B and H (or XT') would have l)een mutually 
at i*ight angles and have i*epreseuted a true physical right six-face, and the product would have been 
its actual volume. This w'as not possible in our investigations, for the length- projection L* was 
not provided by most craniologists. Thus LBH^ or LBU\ still less ^ (Xjj + Zx) x x cannot 
be looked upon as the volumes of right six -faces, closely embracing the skull. Such products are 
merely arbitrary functions of characters of the skull, and there is no theoretical justification for 
4tr 1 

multiplying them by x ^ as a factor, on the basis of the volume of an ellipsoid. The skull may 

resemble an elli^)Soid, but if so its capacity must he measured by the product of three mutually 
rectangular axes, and not by quantities like LBH which are not mutually reotang^ar. 

Of course we may start from any function f of cranial external or internal variables, and its 
fitness will depend on how intense the correlation of C and / may be. That is the sole test of 


* On these points see the paper by M. A. Lewenz and K. Pearson, **0n the measurement of Internal 

Capacity from Cranial Ciroumferenoes,” Biometrika^ Yol. in. pp. 866 — 397, 1904. Cf. especially p. 877. 
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wlieiher one/ia or is not better than a eeoond. Dealing with intraraoial data for Bg^rptiana, 
Series E, Pearson and Hoadley found a oorrelation of € and Hi of *8969 ; they aoccmlingly 
judged that the product (Z< BiMi)^Pi was better than {LBH)miP which gaire a oorrelation of 
*8908 only. They did not go further than providing the intraraoial reconstruction formula for 


Andent Egyptians^ : 

( ) C- -(XmSTfiP* 4- 169*67 ± 34 /n/» (v). 

In the same place they gave the external diametral formula 

(i)C^ •0003496P + 226*62 ± (vi), 


indicating that the internal diametral formula was about 26 per cent, better than the external 
diametral formula. 

If the reconstruction formula be looked upon as C—AP+P, then the ** best” values for A 
and B are to be found from the correlational calculus. Neither A nor B ought to be **guessed.” 

In 1904 Lewenz and Pearson + reinvestigated the formulae for intraraoial and interracial 
reconstruction, as a protest against a ‘^guess” formula of Dr Beddoe}. He had used an /, into 
which he introduced the three oiroumferential arcs, and the cephalic index. This he was of course 
q^nite at liberty to do, although the intraraoial oorrelation of cephalic index and capacity is 
almost insignificant, being about — *16 for dolichocephalic and + *10 for brachyc^phalic races. 
But the gravamen of our charge against Dr Beddoe was that having chosen his /, instead of finding 
its oorrelation with <7, and testing accordingly whether his f was bettor or worse than other 
people’s, he ^'guessed” his A and B, He guessed P«0(!) and felt by trial and error his way to a 
suitable A §. Now we fear Dr Wagner has done something of the same nature. He has chosen 
his/, but ho has not shown that his /has a higher correlation with C7than other /’s. Dr Wagner 
4w 1 

then guesses A as ^ for the coefficient of his /, which has no theoretical justification, substitutes 
o o 

the mean values of his variates in his/, and determines a B for each of his four races so as to 
satisfy their mean capacity ; he then averages these P’s and provides what he treats as interracial 
formulae 

{^)C^ •0005236Z< X i (Pi + Pj) 4- x 222 cm.® \ 

( 9 ) <7*= X i (Pi + B^) xHi+ 192 cm.® J 

Now this seems almost as difficult to accept from the mathematical standpoint as Dr Beddoe’s 
procedure. We must not be misled by the fact that Dr Wagner has taken four rooetf. The 
justification or want of it is precisely the same as if I had taken 'four skulls of a single race and 
based an intraraoial formula on them by taking 

C7« *0(X)6236P< X i (Pi + Pa) X //, + P 

and choosing P to fit those four skulls ! It does not matter that our author is here dealing with 
meana and not individual cranial values. Until the correlation between his racial (mean) C and 
his racial (mean) product P^ is so high that four points will determine accurately his straight 
regression line his formula remains theoretically unjustified. He has made his A take the same 
value not only for both sexes, but for both types of formulae intraraoial and interracial, for it is 
clear that he looks upon 

•0006236P8 4- *222 

* Pearson and Hoadley: Biometrika^ Vol. xxx (1920), p. 92. 

t * * Measurement of Internal Capacity from Cranial Ciroumferenoes , * ’ BUmetrika^ Vol . m. pp. 866 — 897. 

^ «*De revaluation de la signification de la capaoitd oranienne,” L*AfUhropoiogi€t Tome xxv. 1908, 
pp. 867~-294. 

I It is needless here to enter on other errors of Dr Beddoe ; one only is of importance for our present 
purpose: he, like Dr Wagner, did not take care to use the same oharaoters as were chosen by the authors 
of the formula. 



186 


Note on Section VI of Dr K. Wagner' a Memoir 

as an iutraracial formula for the Norwegian skull; nor does he^ake clear how he changes from 
the Li, Bi and Hi for an individual skull to a product Li Hi of racial means, for the mean 
“ elliiisoid ’’ of many skulls would not be the “ ellipsoid ” of mean axes. Even Dr W agner's theoretical 
justification for using the ellipsoidal coefl&oient, * * * § 0005236, falls to the ground when he uses 
i {Bf+Bi) for his intermediate axis, for if we give an ovoid form to the skull by using a maximum 
breadth and a lesser breadth nearer to one end, then the coefficient for the volume will not talm 
the ellipsoidal value. All experience shows that the coefficient of the product term does vary from 
sex to sex and from race to race. Indeed, as I shall show, a better prediction formulae than 
Dr Wagner gets for his fouir races can be at once obtained by considering both coefficients in 
CsBi APi + B as unknown and determined by least square or correlational methods. 

But there is a point which must be regarded in all these product prediction formulae. Wo 
have seen that roughly the characters measured on the skull follow normal distributions. 
Accordingly the regression formula for C on any linear function of L, B, H will be planar, but 
it by no means follows that the regression formula for C on a i>roduct will be linear. No 
theoretical demonstration of this has so far been given*, and at any rate in the o€ise of the product 
of the three circumferential arcs Lewenz and Pearson have shown by graphing the product to the 
capacity that for a wide range of races it is very unlikely that linearity exists t. 

2. Lot us look a little into the arithmetic of the formulae cited. 

The actual result of Dr Wagner's discussion is that he applies for males the intraracial formula 
for Norwegians os an interracial formula, namely : 

C7= 0005236Z, x i (Bf-h Bi)x Hi + 222 (viiw«), 

where the bars above the variables denote racial means, and 

J (Z/je 4-Ljr,)a=mean of internal lengths == the same quantity as used by Hoadley and 
Pearson in 1 929 ; 

= maximum internal breadth, agreeing with Hoadley and Pearson’s measurements; 

maximum external frontal breadth. This was a character not originally selected for 
measurement in the Biometric Laboratory, for its terminals frequently fall on very complicated 
sutures, so that they are very doubtful and subject to large personal equation. Dr Wagner does 
not state how he dealt with much indented sutures. Hi with Dr Wagner is the distance from 
bregma to the aimcular axis less the thickness of the parietal bone. This latter should be in the 
direction of the external meaem'emerit and at the bregma, but our author takes it on the side wall 
of the i>arietiil Ixme, thvis assuming that there is such ajgeneral character as the ** thickness of the 
parietal bone.” The Hoadley -Pearson Hi is the Internal basion height, perpendicular to the 
Frankfort Plane. We will call this Hi for the time being and we have the following values for 
two races, the Norwegian and Egyptians of the 20th to 30th Dynasties ; 


liace 

L, 



Pi 

Si 

Bi' 

^i 

Norwegians 

Egyptians 

172*28 

170*44 

12e{il 
124 -oat 

107*96 
107 •97§ 

236,3003*617 

228,2450*516 

134-06 

132*14 

127.07 

126*86 

293,4671*607 

200,1962*176 


* 1 am not overlooking the fact that if variations from the mean be substituted for L, B and H, then 
LBH can be expanded os a linear function to tk first approximation; but in this case we might as well 
start with a linear function and accordingly discard any of the advantages supposed to arise from the use 
of the product function. 

t See Biometrika, Vol. in. 1904, p. 887, 

t 116-92. 

§ » Bregma auricular height - parietal thickness ss 1 18*43 - 6*46 ss 107 *97. 
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Now let ^ apply to iheee produots tha iutmaoial male formula (tH) of Wagner <intnanMdal §o» 
Jforwegiaoa} aiMl the iutraracial formula of Pearaou axid Hoadley for the Egypyaps. We hav*: 



Norwegians 

Egyptians 

Wagner 

Pearson-Hoadley 

1464 (-S) 
1463 (-3) 

1417 (-23) 
1438 (- 2) 

Observed 

1456 

1440 


These results suggest three points, namely that : 

(i) There would not appear to be any advantage obtainable by introducing B/, a measure 
undoubtedly difficult to determine accurately and destroying the hypothesis that we are taking 
internal measurements. 

(ii) There is no special advantage in using the ellipsoidal coefficient *0005236. 

(iii) We are as likely to get as good a result from the Pearson-Hoadley form of product 
as from Dr Wagner’s P<. 

We will accordingly discard the unsatisfactory 5/, and the still more unsatisfactory with 
its measurement to the bregma and its inadequate bregmatic thickness, and see whether the 
Pearson-Hoadloy P( does or does not give as good results for an interracial formula as Wagner’s 
formula. He had the advantage of measuring the brain-box internally for five races in both sexes, 
using in all 191 male and 182 female skulls. All his series were short, some of them terribly 
short, in six of the ten cases under 25 crania! We took the largest series in the Biometric 
Laboratory, 729 male adult crania, because our purpose was to test how far internal measurements 
gave a better result than external measurements, freeing ourselves as far as possible from the 
errors of random sampling. And we found they did — an improvement resulted of about 26*/^. 
Now Dr Wagner’s great merit is to have increased our supply of data by measuring internally 
five races to which we can add a sixth, «our Egyptians series. 1 have not the boldness to suggest 
an interracial formula based on four, or even six races, but I do wish to show that a formula 
better fitting those six races can be found by the usual processes of statistical theory, than by 
^‘guessing” A and then averaging B. Before doing so t should like to point out that for racial 
comparisons, as far as cranial capacity is concerned, but little stress can be laid on racial estimates 
of capacity which differ from the observed by 26 to 60 cm.^ Thus consider four of Dr Wagner’s 
shorter aeries and compare them with previously determined capacities for longer series : 


Dr Wagner’s Series 

Baoe 

Previous Values 

Oapaoity 

Size of 
Sample 

Oapaoity 

Size of 
Sample 

■n 

16 

Maori 

1476 

43 


13 

AustireJians 

1295 

134 


18 

Eskimos ... 

1559 

34 


63 

Lapps 

1522 

66 


(B^erences are given on p. 143.) 


It is obvious from these 'results that even if Dr Wagner’s interracial formula gave exactly in 
these four races his observed capacities, those capacities would not agree within 58, 18, 33, and 
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49cm.s with the vahies obtained from larger samples. This may be due to the siae of the 
samples, to differences in technique, or to the influence of local races ; but, clearly, any interracial 
prediction formulae coming within 26 cm.** to 60 cm.* of each other may easily be equally valid, if 
we are determining a “ racial ” character, not seeking a short cut to find the mean capacity of 
a series of crania. 

1 propose now to consider in the first place three prediction formulae for cranial capacities on 
the basis of the six races for which internal cranial measurements have been taken, namely: 

(а) Dr Wagner’s formula. 

(б) The “best” formula that could l>e obtained from Dr Wagner’s data, by the use of correla- 
tional theory. 

(c) The Pearson-Hoadley form of product, using all the available data. 

We need the following values : 


Race 


— 

fii 

Pi 


K/ 

Pi 

Norwegians 

172*28 

126*51 

107*96 

235,3003*617 

134*05 

127*07 

293,4671-607 

Lapps 

168*05 

130*33 

108*71 

238,0961*691 

140*05 

124*88 

293,9101*064 

Maori 

171*21 

121*78 

109*19 

237,6606*465 

131-3] 

130*44 

293,2497*960 

Australians 

163*77 

117*31 

106*46 

204,5294*477 

1*23*38 

128*77 

260,1919*229 

Eskimos ... 

176*53 

122*36 

114*92 

248,2296*255 

130*78 

133*61 

308,4599*744 

Egyptians E 

170*44 

124*03^ 

107*97 

228,2450*516 

132*14 

128*85 1 

290,1952*175 


From these we obtain for (6) by the method of correlation, 

*00056241 178 Pi-H 1 36*345 

and for (c) by the same method, 

*00051496732 Pi - 61*083 
while (a) is as before, __ 

*0005236 Pi, -h 222 cm.* . . . 

From these formulae results the following table : 


.(c), 

(«). 


Race 

Observed 

Capacity 

Wagner’s 

(«) 

“Best” P. 
Formula {b) 

Pearson’s P^' 
Formula (c) 

Norwegians ... 

1456 

1454 ( - 2) 

1460 (+ 4) 

1450 (- 6) 

Lapps 

1473 

1469 (- 4) 

1476 (+ 2) 

1452 (-21) 

Maori 

1418 

1414 ( - 4) 

1417 (- 1) 

1449 ( + 31) 

Australians ... 

1278 

1293 ( + 15) 

1287 (+ 9) 

1279 (+ 1) 

Eskimos 

1526 

1622 ( - 4) 

1532 (+ 6) 

1527 (+ 1) 

Egyptians E 

1440 

1417 (-23) 

1420 (-20) 

1433 (- 7) 

Mean Root Square Residual 

11*69 

9*47 

16*76 


Equation (6) shows that iising Wagner’s value for the ellipsoidal coefficient does not give 
the b^t values, nor does Wagner’s value of the constant term. Wo improve the root mean square 
residual by about 9*4 per cent, by using a correct statistical procedure. We see further as a result 
of the above table that the Pearson-Hoadley gives a worse result than Dr Wagner’s, — at least 
when we deal with the above six races. This is a misfortune, for its and Si are so much 


Z/ for these Egyptians s 115 *92 mm. 
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eMier of determinatioD than using both S/ and and the oomplioated and theorotioat^ v«ry 
doubtftil ffi. But it is clear that using the internal vertical basion height so easy to deter* 
mine, does not provide a formula so good, judged by the square root mean square residual, as 
Dr Wagner’s, There are points to be noted however in the formula (c). It gives the highest and 
lowest raoe oapaoities excellently, far better than (a) or (6) ; it fails peculiarly with the Maori 
capacity, but here it must be noted that Dr Wagner’s meastne of capacity is much below that of 
Scott, indeed by almost 50 cm.*. I defer a closer discussion of this point till we turn to external 
diametral product formulae. Meanwhile I will try to reach a formula betjter than Dr Wagner’s in 
which all the quantities to be measured can be obtained in a fairly simple manner, and which 
avoids his JB^and Si. 

We have seen so far that the vertical basion^eight, while not bad for a component of an inter- 
racial formula, is not as good as Dr Wagner’s Si, Let us see what the vertical opisthion height 
Si' will lead us to. 

The table of needful measurements is the following : 


Baoe 



B 

Pi" 

Predicted 

Talue 

Deviation 
from Observed 

Norwegians ... 

172*28 

134*06 

129*28 

298,5609*964 

1476*6 

+ 20 

Lapps 

168*06 

140*06 

127*60 

300,3117*369 

1486 

+ 12 

Maori 

171*21 

131*31 

124*29 

279,4236*212 

1383 

-36 

Australians ... 

163*77 

123*38 

129*08 

260,8183*071 

1291 

+ 13 

Eskimos 

176-63 

130*79 

131*39 

303,3347*507 

1600 

-26 

Egyptians £... 

170*44 

132*14 

130*66 

294,2491*670 

1456 

+ 16 


Obtaining the “best” formula by the method of least squares, we have: 

00049042126 4 12*310 (viii). 

Substituting the products P/' in the formula, we obtain the predictions given in the sixth 
column of the above table, which provide the deviations from the observed values given in the 
seventh column. These have a root mean square residual of 21*76 cm.*, not a very serious 
resulting error when we think of the amounts by which one sample of a raoe differs from a second, 
but worse than the formula which uses the vertical basion height, and very considerably worse 
than the “best” formula using Wagner’s P^. At first sight it would appear as if nothing had been 
obtained by investigating the results of using the vertical basion and vertical opisthion internal 
heights. But a careful examination of the two sets of deviations shows that if we use formula (o) 
in round numbers seven times and formula (viii) five times, i.e. weight them with and ^ re- 
spectively, the large deviations will all be reduced to insignificant proportions. We reach by this 
procedure the formula : 

(-00030039177 *00020434219.^/') 30*603 (ix). 

We have then the following results : 


Bace 

Predicted Value 

Deviation from Observed 

Norwegians ... 
Lapps 

Maori 

Australians ... 
Eskimqis 
Egyptians E... 

1461 

1466 

1421 

1284 

1616 

1442 

+ 6^ 

— 7 

, g Root Mean 

T a ’ Square Residual 

-e-10 
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For practical purposes it is adequate to use 




( •3004 + -2043 JSr/O 

iooo 


-30*60 




whore and HI' are the mean internal vertical heights from basion and opisthion respectively, 
Li is the mean of half the sum of the two maximum internal lengths, and Bi the maximum 
internal breadth. 


Formula (ix'*^) will predict for these six races the cranial capacity with a mean residual almost 
half that of Wagner’s, and about two-thirds of that of the “best” formula available when using 
Wagner’s expression for the product. Are we then to say that we have obtained a satisfactory 
intdracM formula ? Personally, I should say most decidedly not. You cannot form a satisfactory 
interracial formula on the basis of six races, any more than an intraracial formula on six skulls^, 
even if they are wisely distributed. Tliere is only one brachycophalic race — the Lapps — included, 
and no proper representation of Asia, Europe, America or Africa. Another group of six quite 
different races would probably give an entirely dift’orent equation. This is a problem which 
I would urge Dr Wagner to bvckle; he would at le»ist increase our data for internal cranial 
comparisons. 


Pearson and Hoadley demonstrated for the first time that the internal diametral product did 
give a higher cf)rrelation with cranial capacity than the external, and they measured the amount 
of the difference on their scries of Egyptians. 73r Wagner has not worked out this difference for 
any of his five series, but he has contributed to our material for internal measurement. This is 
a valuable step in the right dirciction, but till the data are increased six- to tenfold on consider- 
ably longer series than some he has used, we ought not to set about finding, still less speak of, an 
interracial formula. Indeed, before even this is attempted, there ought to be some standardisation 
of instrument used ff)r internal measurement, and further a standardisation of technique in 
meosureinont, especially that of the capacity. I am inclined to think, for example, that 
Dr Wagner is getting on his series lower capacities than other craniologists would do in using 
apparently the same method. What is more, I do not believe many craniologists woidd accept 
his process of deducing the internal “ear-bregma” height Hi by subtracting from the external 
“etir-bregma” height, a quantity he terms the “thickness of the parietal bone,” which he does not 
measure at the bregma. Even if he did measure it at the bregma, it does not follow that the 
external “ear-bregma” line cuts the cranial vault at right angles. Ho does not make use of the 
quantity ho terms the glal^ellar thickness of the frontal bone, but I am inclined to think that his 
process does not measure this thickness. I hold, however, that standardisation of measurements 
is essential before more work is done, and that most probably the Z^, Bi and JY/, Hi (all 
measured by Dr Wagner, but the latter two not used) will l>e easier to accurately determine, and 
give better results than using Bf and Hi as finally adopted by him. 


3. Lastly, I should like to say a few words on Dr Lee’s and my external diametral product 
prediction formulae. They give as closely as it is possible to give a prediction, formula with the 
LB H prod\ict for the races on which they are based. Thus (ii) gfive root mean squai*e residuals 
36*16 cm.® and 35*92 era.®, and (iii) 22*17 cm.® and 35*92 cm.*® respectively. 

The same formulae applied to Dr Wagner’s five races give : 

(ii) 41*02 cm.® and cm.®, as against 36*16 cm.® and 35*92 cm.® ; 

(iii) 41*70 cm.® and 33*09 cm.®, as against 22*17 cm.® and 35*92 cm.®. 


* The six skulls have an advantage over the six races. With caution their measurements may be 
accurately made, and the prediction formula follows by the usual statistical procedure. Hut with racial 
means a new factor enters into the problem, namely the size of the samples of each race, and tbi* will 
affect the A and B constants of the prediction formulae, even if the individual crania of each race be 
measured with extreme accuracy. 
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Wl&fla the femele voot mean equare re^diiale have actually been decreased themale iroot meae 
squsce reaidualB) aa we should anticipate, have been considerably increased, whw we apply the 
formulae to the data for races on which they were not computed. But suppoaiDg our formulae 
were correctly applied, would a prediction with a root mean square residual d 41 cm.® be 
▼aluelees ? Is it worth our while really striving at a root mean square residual of 9 or even 6 om.*l 
I doubt it, if two observers, owing to difference of technique, to varying size of sample, or to the 
effect of local races, may differ by 20 to 60 cm.^ in their direct measures of capacity ! 

Thus far I have assumed that Dr Wagner has applied our formulae correctly ; but he has not 
done so. He has replaced our of the Frankfurter VereUtndigwng by what he terms the 

" ear-bregma^’ distance. He does not state how he measures this or how he found his poria. But 
assuming these to be determined by the skull on the crauiophor, our Off is not his “ear-bi-egma” 
height, as he supposes. It may in some races be as much as 2 — 3 mm. greater (e.g. in the Maori 
or the Tibetan skulls), or there may be practically equality as in the case of the type English 
skull. It all depends on the slope of the cranial roof from aper to bregpna. 1 — 3 millimetres may not 
appear much regarded in itself, but it modifes most sensibly P the product. For example, taking 
the Lapps, if the Off exceed by 2*3 mm. the ear-bregma distance in males, then formula (ii) would 
predict the value 1466 instead of 1419, and bo in error ^ 7 instead of — 54, while if the female Off 
exceeded by 0*8 inm. the ear-bregma distance, the predicted capacity would have been 1301 
instead of 1294, showing no error instead of —7. I lay no stress on these illustrations; I only 
intend to indicate that the actual Off cannot be measured by the ^‘ear-bregma” height ; the 
difference depends on how the apex-bregma stretch of the crown slopes and how near together the 
apex and bregma are situated. We cannot therefore apply legitimately formulae (ii) and (iii), 
using the “ear-bregma” distance. We can only test what an external diametral formula will 
achieve by working one out for Wagner’s data. I have worked out the best male prediction 
formulae for the above six races, when we take as our products of external measurements 

(17370 and Pi^^^ZBff\ 

where Off' is not our vertical height 037, but the “ear- bregma” distance, and H' is not our 
vertical basion height ff^ but the basion bregma distance. All these external characters are 
provided by Wagner for his series. 

We form the following table : 


Baoe 

L 

5 

Off' 

P/" 

ff' 

Pi" 

Norwegians ... 

187*70 

142*34 


tm 

302,6526*466 

132*81 

364,6313*723 

Lapps 

179*17 

146*62 





Maori 

186*81 

137*88 



136*63 

361,9228*479 

Australians ... 

183*30 

130*77 

113*85 


136*92 

32e,8021-666 

Eskimos 

190*44 

137*66 

119*36 

313,2104*6 

139*39 

366,1689*671 

Egyptians E... 

186*33 

139*16 

112*43 

290,2757*630 

133*61 

344,7008*549 


Now before going further we note : 

(i) That the Lapp P/” and P^^ are less than those for the Norwegians. Accordingly, the 
predicted capacity will inevitably be lees for the Lapps than for the Norwegians; the observed 
capacity for the former is greater than for the latter. 

(ii) The Maori Pf* and Pp are greater than those for the Lapps. Accordingly, the predicted 
capacities for the Maori will be greater than for the Lapps ; the observed capacity for the former 
is considerably less than for the latter. 

(iii) The Egyptian E and P/^ are less than those for the Maori, but the observed 
caparil^ of the latter is less than for the Bgyptiims. 
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These results indicate that the prediction formulae baaed on Pi'* and Pi'^ will not be veiy 
satisfactory in reproducing the capacities said to be observed. I obtained the following prediction 
formulae using : 

0H\ the “ear-bregma” length, 

-00057622174 Pr- 272*514 (x) ; 

H'y the basion-bregma length, 

Ci ^ -000060513039 P< - 320*294 (xi). 

We have the following results for males: 


Bace 

Formula (x) 

Deviation 

Formula (xi) 

Deviation 

Norwegians ... 

1471 

+ 15 

1472 

+ 16 

Lapps 

1437 

-36 

1391 

-82 

Maori 

1450 

+ 32 

1457 

+ 39 

Australians ... 

1300 

+ 22 

1325 

+ 47 

Eskimos 

1532 

+ 6 

1524 

- 2 

Egyptians E... 

1400 

-40 

1421 

-19 

Root Mean Sciuare Residual 

27-88 


42-96 


From this table we csan draw two conclusion.s : 

(i) It confirms Dr WagneFs statement that it is better to base a prediction formula on the 
auricular height rather than on the basion height. 

(ii) That the large values obtained by applying the Pearson-Lee formulae (apart from the 
error of using OH' for OH') arises in large part from applying them to a set of entirely different 
races from those on which they are based. If we take the best material for external measurement 
available in the data, namely, X, and (TH'^ we have a root mean square residual of 27*88, 
a value as good as those obtained by applying our (ii) and (iii) to the data on which they were 
based, and far bettor than applying our (ii) and (iii) to WagneFs data. In other words, it is not 
legitimate to compare the relative efficacy of two formulae by applying one to the material on 
which it is based and the other to material totally different from that on which it is itself beused. 

I am far from asserting that wo cannot get a better external formula than one with a root 
mean square residual of 28, and that we ought to use my internal product formula with a residual 
of 6, or even Dr WagnePs with a rcisidual of about 12. There are many points to be considered 
about the data on which either type of formula is baaed. We cannot make comparisons on 
internal measurements, for they only exist at present for six series, but we can do so on external 
measurements. I have collected some to place beside Dr Wagner’s data. 

The table (p. 143) appears very instructive. We see that different recorders give distinctly 
different values for approximately the same racial group. Thus, take the Eskimo group ; Ftlrst 
and Hansen are dealing with a series which has mean diameters undoubtedly smaller than 
Wagner’s series. The relative Pia would suggest a skull with 124 cm.^ less capacity than that 
of Wagner’s series, yet their mean cai>acity is 2 cm.® greater; it might be anticipated to be 
much loss. Again, consider Hrdli^ka’s series of Greenland Eskimo ; his series, almost double 
that of Wagner’s in number, suggests crania considerably smaller than those of Wagner, and the 
prediction formula suggests an average capeicity 20 cm.® less than Wagner’s, but Hrdli5ka 
found a capacity 32 cm.® ffreater. 

Next turn to the Maori. Scott’s series, more than double that of Wagner, suggests a smaller 
capacity by 20 cm.® than Wagner’s, yet his observed capacity is 58 cm.® greater than Wagner’s 1 
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Baoe 

Beeorder 

C 

Z 

s 



Pzedleted 

Cspadiy 

Lapps 


Wagner* (63) 

1473 

179-17 

146-62 

129-06 

338,7819*103 

MM 


... ... 

Schreiner t (151) 

1465 

178-3 

147-0 

128-1 






Hfkllstenl (^) 

1622 


147-0 

129-6 

343,9896*560 

u 

Australians (?) 

Wagner* (13) 

1278 

183-30 

130*77 

136*9S 

325,8021*666 

1886 

a 

(Queensland) 

MorantS (19) 

1288 

186-1 

131-1 

138*3 

323,4739-113 

1314 


(General) ... 

Morantf (146) 

1295 

187-8 

132-2 

138-1 

342,8630*796 

1412 

Maori 


Wagner* (16) 

1418 

186-81 

137-88 

136*68 

361,0328*479 

1467 




Soottll (48) 

1476 

186-6 

140-1 

137*6 

357,6024*480 

1486 

Eskimos 

Wagner* (18) 

1626 

190*44 

137-66 

139*39 

366,1689-671 

1524 

7) 

(Greenland) 

Hrdli6kair (34) 

1659 

190*6 

136-9 

139-6 

361,1608-626 

1604 

»» 



Fiirst and Hansen IT (176) 

1528 

188*4 

134-4 

138-2 

340,6486-272 

1400 


OoDsidering the Australiant), Moraiit’s general series suggests by its diameters a series of crania 
larger by 83 cm.* than Wagner’s, and this is based not on 13 but 146 skulls, but the observed 
capacities only differ by 17 cm.*. The Queensland and Wagner’s series of about the same sise 
suggest that the former falls short of the latter by 11 cm.*, but the observed capacities differ in 
the inverse way by 10 cm.*. This again suggests, it does not of course prove, that Wagner’s value 
is too small. 

Lastly, turning to the Lapp data, we note that Schreiner’s and Wagner’s, although differing 
much in predicted and observed values, give very close differences of observed and of predicted 
values, 8 cm.* and 16 cm.*. This is as it should be, if Wagner is a member of Schreiner’s school. 
But if we turn to Hiillsten’s measurements, the latter’s crania are larger for all diameters, and give 
a predicted value exceeding Wagner’s by 26 cm.*, yet the actually observed values are such that 
Hailsten’s exceed Wagner’s by 49 cm.*. 

Assuming, as I think we may, that the measured diameters will not difler largely by 
differences of personal equation in measurements, the above results do appear to indicate that 
Dr Wagner’s technique in determining capacity gives ^ults falling short of that of other 
observers ; possibly, though I cannot prove it, they packed tighter. But what I do want to 
emphasise is this, that until we have standardisation in measurement of capacities, until we have 
such large series that errors of random sampling are negligible, and until we have such careful 
specification of the source of cranial series that we can define local races, various observers will 
give for the same **raoe” cranial capacities differing by any amount up to 60 cm.*. In view of this, 
refined prediction formulae which provide a 6 cm.* root mean square residual are scarcely to be 
preferred to rougher formulae with 20 to 30 cm.* root mean square residuals. In other words, 
till the three conditions mentioned above are satisfied, 1 am still of the opinion that while inteiml 
measurements of the skull may lead us to interesting results as to the size characters of the 
brain, their value as leading to cranial capacity pi^iotion is hardly worth the increased labour 

* Memoir in this issue. 

t Zur Oiteologie der LappeUt Bd. n. 8. 68. Instituttetfor SammenlignendeKtUtur/oritkningf Oslo, 1981. 

% HUlsten: Matdriaux pour servir k la oonnalssanoe dee orines des peuples Finnois. Bidrag till 
Kannedom af Finlands Natur ooh Folk Finaka V$ten$kap§-8ocieteten, 

I Biometrika, Vol. xxx. pp. 421, 429. 

|] Biometrika, Vol. xi. p. 98. 

IT Annala of Eugmict, Vol. x. pp. 261, 268. 













144 


Note on Section VI of Dr K. Wagner's Memoir 

of their determination. Overlooking for a moment the question of differences of technique, we 
may note that the standard error for capacity of a series of 13 crania is about 31 cm.’; of a series 
of 16, 28 cm.*; of a series of 18, 27 cm.*; of a series of 63, 14 cm.*; and of a series of 81, 13 cm.*; 
and further that even deviations 50 greater than these in excess or defect will occur in 13 to 
14 of samples of these sizes. What, I ask, is the value of using a formula with a 12 or even 
6 cm.* residual error on the oh^ervod small samples to get a predicted racial mean capacity when 
our sampling deviations alone are of the above order? 

One further point I would like to emphasise strongly. Pearson and Hoadley found in the 
case of 729 male Egyptian crania the internal length of the right hemisphere greater than that 
of the left hemisphere. This was so opixjsed to the statements of some earlier investigators, 
notably Broca, who had asserted that the localisation of certain important centres found in the 
left hemisphere led to or must lead to a greater size of the left hemisphere, that Dr Woo 
investigated this point by direct measurements on the frontal and parietal bones of a number of 
races. His conclusions for 887 Egyptian male crania showed larger external measurements on the 
right frontal and parietal bones. This evidence Dr Wagner (see his p. 116) dismisses somewhat 
light-heartedly. We may add that Dr Woo has found in the following series (i) English (120), 
(ii) Australian (100), (iii) Southern Chinese (80), (iv) Punjabis (80), (v) Kanakas (60), (vi) Congo 
Negroes (50), and (vii) Nigerian Negroes (45), for the three measurements (a) ophryon to 
stepanion, (6) minimum arc bregma to asterion, (c) minimtim arc sphenion to lambda with various 
degrees of difference preponderance in the size of the right frontal and parietal bones. It is 
needless to remark that these are the disti’icts in which the active centres are stated to be localised 
on the left side and so to give it its predominance. This predominance of the left side Dr Wagner 
tells us “is confirmed by certain other investigations which Hoadley and Pearson do not mention 
in their survey of the literature.” He then refers to papers by Hrdlidka (1907) and Connolly (1932). 
The former might have been referred to had we known of it, but hardly the latter, as our paper was 
published thi’ee years before Connolly’s ! In neither of those pa^^ers is the procedure exactly com- 
parable with the measurements taken by Hoadley and Pearson, Dr Wagner, or Dr Woo, and for 
numbers Connolly’s series are not comparable with those of Hoadley and Pearson. The contra- 
dictions are so great that one is almost compelled either to believe that there is really no 
predominance of either hemisphere, or to accept the impossible, namely that some investigators, 
when the skulls are in norma hasalis with the frontal, or it may be the occipital, on their right, 
have wrongly designated their metisurements, mistaking right for left hemisphere, or vice versa ! 

Finally with regard to the measurements made by Dr Wagner on the cranial contours 
published in this Journal and discussed by him on pp. 1 13 — 1 16 above, it is only fair to Dr Woo 
to remark that a similar investigation on the same contours was made two years ago by Dr Woo 
himself and will be found already published by the Academia Smica*, Personally 1 am doubtful 
as to this use of such contours, and certain that contours diagraphically drawn should not be 
included, for they do not represent points lying in one and the same plane, and accordingly the 
distances measured are not necessarily homologous lengths on left and right hand sides of a 
median line. 

* ** Evidence of the Asymmetry of the Human Skull. Derived from Contour Measurements,” Ts’ai 
Yilan P’el Anniversary Volume (Suppl. Vol. i. Bulletin of the Institute of History and Philology, 
Academia Smica, Peiping, 1988). Dr Woo states that ** where the differences are real and not chance 
ones, it is always the right side which is the greater,” and again that **the main conclusions of the 
present enquiry are in perfect accord with those of my earlier study, both having shown that the normal 
human skull is not symmetrical since the right side is slightly larger than the left in several of its 

parts.” 



SOME PROBLEMS IN THE ANALYSIS OF REGRESSION 
AMONG k SAMPLES OF TWO VARIABLES 

By B. L. WELCH, B.A 
I. ZNi:i(0i>troTioK. 

In a series of papers J. Neyman axid E. S. Pearson have developed a method 
based on a special application of the principle of likelihood, the object of which is 
to determine the most suitable test criteria to employJn testing certain statistical 
hypotheses. 

E. S. Pearson and S. S. Wilks ^ive extended the method to two variables. 
k samples, iSe* of two variables x and y being given, each sample was assumed 
to be drawn from a normal bivariate population wt specified by parameters 

bt, tTast, tTyt, and pt; i.e. the joint distribution of x and y for the population tti 
was assumed to be of the form 

L ^ (y-W 

— * 1 f 

Certain hypotheses about the population parameters were then considered*. 

In problems of regression rather than correlation this initial assumption may 
not however be appropriate, and it is proposed in the present note to consider the 
application of these methods of approach to the case in which we are primarily 
concerned with the analysis of variation of y for a given value of x. In the type of 
problem under review the observations fall naturally into a number, say k, of groups, 
and within the ^th group, the deviations of y*' from the regression straight line 
y are supposed normally distributed with standard deviation Thus 

for the ^th group the probability distribution of y for a given x is assumed to be of 
the form 



In Fig. 1 a situation is shown where there are three groups, the sets of 
observations (x, y) for these being drawn for convenience so that they fall clear of 
one another. 

The value of y that appears with any particular x in group t is regarded as 
a random deviate from a normal distribution whose mean is fitx+at and standard 
deviation <re, where «r«, ^nd Oi are independent of x and are thus characters of the 
whole group t Taking i^get&er all the values of x that can possibly occur in the group 

« B. S. PaMwm and S. S. Wilkt: Biom$$Hha, Yol. m. 1988, pif. B88~-78. 
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a number of means Aar + aj are obtained which all lie on the line Y ^ 4-a«. 

These population regression lines are shown in Fig. 1 (^ ■« 1, 2, 3). It should be 
noted that nothing has been said about the distribution of a?, which need not even 
be continuous. Thus the array means do not necessarily lie continuously along the 
regression line. It follows that if, in a particular problem, the assumption of 
equation ( 1 ) can be made legitimately for certain ic’s in the group but not for 
others, then conclusions drawn from the criteria obtained below will still be valid 
within the restricted area defined by the permissible xh. 

Our purpose is to apply the method of Neyrnan and Pearson to determine from 
the observed data criteria appropriate to test a variety of hypotheses regarding the 
group parameters and One such hypothesis may be illustrated by con- 

DIAGBAM REPRESENTING SAMPLES FOR CASE K-3. 



SCAL.E OF X 

Fig. 1. 


sidering the following example. An investigation into the heights of school- 
children in Glasgow indicated that within certain limits of age, the regression of 
height (y) on age {x) both for boys and for girls could be regarded as linear, and 
that for a given age the distribution of height could in each case be taken as 
normal with standai*d deviation the same for each age considered. Clearly here the 
distribution of x need not concern us. All that we need to know is the range of age 
within which the assumption 7 of linearity can be made. Assuming that such 
relations hold for a number of different groups of children (they may belong to 
different schools or social classes or races) we might ask whether the same 
regression line can be used to predict height from age for all groups, and also 
whether the variance about this regression line can be regarded as being the same 
throughout. This is in effect asking whether fit, and at are the same for all t. 
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Another situation which may be tested is shown in Fig. 1 where 0%t but 

4Zt is not constant. The regression lines here are parallel but hot coincident. 

The method employed below leads for each hypothesis considered to a differrat 
test criterion K The sampling distribution to which this criterion is referred is 
that which it would follow (were the hypothesis tested true) if in repeated san^ling 
the values of w were kept fixed, so that the only variation is that of y. It will be 
seen, however, that this sampling distribution of X is independent of the Thus, 
provided the hypothesis tested concerns only relations between crtt /3t, and at which 
do not contain w, the condition of fixed a!*B can be removed and the test of signifi- 
cance can be regarded as being made for any set of w'a subject only to equation (1) 
being satisfied. 


11. The Method. 


The observations in each group may be described as forming a sample 
Se » 1, 2, . . . k) of (a?, y). As stated above, this sample will be regarded rather 
as a sample of y, in which each y corresponds to a given a?. It will then be 
found convenient to speak of a population irt from which has been drawn, mean- 
ing by this that for each sxt in St the corresponding y is distributed normally about 
/Stic-j-at with standard deviation o-f wt is then specified by the parameters 
<Tt, /3t, and cct. 


The probability function for the joint occurrence of the k samples St is obtained 
by multiplying together expressions of the type given in equation (1). If (a?t<» yu) 
is the tth individual of the sample St, nt the number of individuals in that sample 

k 

and JV' «« S (nt), then this function may be written 


where 


, I 2 


• /8t»« ■ 




o-t* 




( 2 ), 

( 3 ). 


It will be convenient to denote by A, B, C, D the following conditions : 
A that included in equation (1), namely that 


B 

C 

D 


p (y I ic) . 


a?? 




V^TTO-t 

that <rt is the same for each group, i.e. 

that c-t ■* <r (^ ■« 1, 2, . . . Ar), 
that (e - 1, 2, ...A?), 

that Ot (t=*l, 2, ...A;). 


In addition it will be necessary to express algebraically the condition that the 
population means of the groups are collinear. If Wt and are the means for the 

xo— a 
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fit observations in the ^th group and the symbol [ ] is used to denote expected or 
population value, then 

= + (« — 1 , 

and the condition that the point {Bt, [yt\) lie on the line 

y ^ lx m 

is that ^tXt +at^l^+m « 1, 2, ... A:). 

This will be called condition £. 

Neyman and Pearson's method afiPords a simple rule for obtaining appropriate 
test criteria once two sets of conditions have been defined. These are : (a) the 
conditions which can be assumed to be satisfied, and (b) the conditions which define 
the hypothesis to be tested. For instance, suppose that in addition to the assump- 
tion implied in equation (1) we can assume that txt is the same for each group and 
we wish to test whether in addition all the are equal. Then, using the notation 
given above, the set of conditions (a) consists of A and B taken together and the 
set (6) of A, B, and C. The conditions (a) define a class Cl of admissible sets of 
populations tti, and the conditions (6) define a sub-class a>, of fl, to which the set wt 
must belong if the hypothesis tested be true. 

The maximum value of p (equation (2)), when the population parameters vary 
so that the set irt belongs to Cl, is called p (Cl max.). The maximum value when the 
set TTt is restricted to q> is called p (a> max.). The likelihood of the hypothesis has 
then been defined to be 

p (fi > max.) 
p (Cl max.) * 

The greater the value of X the more likely is it judged that the set ttj is 
restricted to <», i.e. that the hypothesis tested is true. The scale of X on which 
judgment is based is obtained by referring X to its sampling distribution when the 
hypothesis is true. 

Much of the value of this method of appro€kch lies in the fact that an essential 
first step in obtaining X is the definition of the classes Cl and this requirement 
encouraging clear thinking in the process of analysis. 

III. Types of Hypotheses Ck>NsiDSRED and Notation Used. 

The hypotheses that will be tested, regarding the regression of y upon x, ms,y 
be divided into three classes analogous to those which Neyman and Pearson 
denoted by JET, Hi, and H»*. Firstly there is the hypothesis H, that cr« is the same 
for every group, i.e. that the variance of y about the regression lines is the same 
throughout the whole range of data. This is the hypothesis that condition B above 
is true. Then there are several hyx>otheBe8 of the Hi type concerning the values of 
at and fit (conditions C, D or £), in testing which Hi is accepted as true. For each 

* J. Neyman and E. S. Pearson, **On the Problem of k Samples,’' BuUeHn de VAcadimie PolonaUe 

dea Sciencea et dea X^ttrea, S^rie A, 1981. 
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of these it is twod l^t the pre^e&t metdiod leads to one of R. A. Fidier’s tOstis in 
the analyeda of Tansiiioe. Finallj there are hypotheses of the ff type where nothing 
more is assumed than the condition A above and the hypothesis tested ^cmicems 
both oonditicm B and one or more of the oonditi<ms C, D, and £. The likelihood 
criteria' fbr this type of hypothesis and for the Hx hypothesis introduces into the 
analysis the geometric mean of sums of squares. 


In developing the tests the following notation will be used. ^ and are the 
means for the whole AT « S (n«) observations, S indicates summation for all 

observations (oin, yu) within the fth group, 2 indicates summation for all groups,, 
and ^-2 2. ^ 

t i 


ht 


2 («« — Wi) (ya — pt) 
2 (a?f< — ^e)* 

8 - ®o) iVt - yo) 


8 (wa^ St) (yti - yt) 

* 8 jmti — (yti go) 

Siwti^So)^ •/ 


.( 4 ) 



Fig. 8. 
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Thus ht is the regression coefficient in the ^th group, and 6* an average value 
obtained fipom within groups on the assumption that the k values of 6* do not differ 
significantly, hm is the coefficient obtained from the weighted mean of the groups, 
and is obtained from pooling together the whole N observationa ho may also be 
regarded as a weighted average of ha and hm* 

The corresponding regression lines are represented diagrammatically in Fig. 2, 
which shows also the parts into which — yo may be broken, as follows : 

fi = (y« - yd - ht (xti - » (pa - ^m) (ooti - ^t), 

fs “ — ^o), fi" *• (hm — (^H — ®o)» • (6) 

fs = (y« - yo) ~ 6m & * 6o (a7«i - 

The parts fi, fa, fs, f 4 (= f/ and fa can be readily shown to be ortho- 

gonal, the product of any pair summed over the whole N observations vanishing. 
Hence, for example, 

S (fx ft + fa + fa)* - ^(fi») + 5 (&«) + ^(f,») + (fa*) (6). 

IV. The Test for ff . 

As an example of a hypothesis of the H type, take the inclusive hypothesis 
that the populations irt have at mma-, A and Ot =* a * 1, 2, . . . A;), i.e. that there 
is no differentiation whatever between the form of regression in the different groups. 
Suppose we wish to test this hypothesis assuming only the form of the probability 
functions. 

n is defined by the condition A and p (fl max.) occurs when 

« ht ; at^yt- hxt ; Utat^ = S {{ya - yd “ (a^ti - 2 Pe))*. 

The conditions defining cd are A, B, C, and D. Thus to find p (a> max.) we write 
/3t ** /S, •* «, and cr* * cr in (2) and maximise with respect to a, /S, and a. This gives 

/3^ho; a^yo--hoXo\ Ar<7*=:iS^{(yei- yo)- 6o(^«-^)}* 

On substituting we find \ = p (« max.)/p (H max.), or using the more convenient 
criterion L 

L = ^ 1 L"«« 1_ (7), 

2^ S' {(y« - yo) — 6o {poti - ®o)}* 

i.e. the ratio of the weighted geometric mean of the variance about the regression 
lines ht to the total variance about a line fitted to the whole data. 

It will be seen from (6) and (6) that L may be written 
L 
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If the hypot^ema H is true, it is known ^ that 

+ 6i + ft -f w distributed as with JV — 2 degrees of freedom, 

S (fi*)/<r* is distributed as yf with — 2 degrees of froedom, 

and ^ (ft + fi + f4)“/^* distributed as 3^ with 2A? -> 2 degrees of freedom. 

It follows that L may be written in the form 


t W/ 2(xt*)+x,*' 


where the A; + 1 values of are independently distributed^. 

Following the lines of Neyman and Pearson’s proof for the case of a single 
variable it can be shown that thepth moment ooefficient of L about zero is 


Mr 


Np ^ ^ n 


fnt-2 


r(V) 


....( 8 ). 


If the number of observations in each group is the same, i.e. »« = n, then 


t^p 






J 


.( 8 '). 


. As in the case of a single variable, it is probable that the sampling distribution 
of L can here again be represented approximately by the Tjrpe I distribution 


p (i) _ I’ (^_± »?*) a - 


( 9 ). 


the constants being calculated as follows by the method outlined by the above two 
writersj : 

^. (1 -«;)(«•- >.7 (10,, 

where /ai' and are given by (8). 

The probability that L is leas than some specified value Xo may then be found 


♦ A general theorem from which it follows that these stuns of squares are distributed independently 
in X*, is proved by B. A. Fisher in Metron^ Vol. v. 1936. A proof following somewhat different lines is 
given by W. O. Cochran in Proe, Comb, Phil. Soe . , VoK xxx. Part «. 19S4. 

+ In the present case x#* and x«* have resj^tively tig - 2 1, 2, ... A) and 2* - 2 degrees of freedom : 

Ln Neyman and Pearson’s case the degrees of freedom were tig - 1 and fc - 1 respectively. Their proof will 
t>e found in pp. 468 — 471, Bi$UeHn ds VAc€ui4m4§ Polonaise dw Seieneta et dsM Ztettre*, S4rie A, 1981. 
t Loe. dt. and also in Bipm^triha, Vol. xxiv, p. 416. 
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from the Incomplete B^fimction Tables^. If only the 5 and 1 points of L are 
wanted, they may be obtained by interpolating into R. A. Fisher’s jr-tables with 

and then using the transformation X «= where the letters fx and^t are 
used to denote degrees of freedom in plooe of Fisher’s and 


V. The Test for Hx, 

The class 12 is now defined by condition A, and a> by conditions A and B, i.e the 
test is whether <ri is constant for all groups, p (O max.) occurs when 

at^yt- htXt ; - S {(y« — ) - ht {xti — ^e)}* 

p (to max.) occurs when 

at^yt — htXt ; Na^ » 8 {(ya - ye) - bt (x^ - xt)}\ 

On substituting and writing Lx *= we have 


;r ^ - bt (xti - ®e)}* 

i 


•( 11 ), 


^ S {(y« - y,) - ht (jcti - ««)}• 

i.e. the ratio of the weighted geometric mean of the variance about the regression 
lines ht to the weighted arithmetic mean of the same. Our only concern is now 
with the parts fx (Fig. 2), and when Hx is true Xx can be written in the form 




where each is distributed in the known form with tit — 2 degrees of freedom 

Ar). 

The moments of Lx can be found in the same way as those of Neyman and 
Pearson’s single variate Xi, which is of the same form but with We — 1 degrees of 
freedom for xt^- moment of Lx about zero is thus 


/ip - • 








•(12). 


and if the tiumber of observations in each group is the same (n* » w), then 


kPT 








■c 


n-2 


'’(^j 


.( 12 '). 


* Table* of the Incomplete Beta Function, edited bj Karl Pearson, Biometrika OlBoe, University College, 
London. 
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Afi before, a Type 1 diatribufdcm is used as an approximation and mi and eit are 
calculated by substitiiting firom (12) into <10). The Incomplete B-fune^on Taldes 
can then be used to give < £«> 

In the case of a single variate tables have been given for certain points of the 
distribution of la when the number of individuals in each sample is the same^. 
These tables may be used in the presmit test if they are entered with n — 1 instead 
of n. 


VL The Tests fob Hi. 

The essential preliminary condition fc»r testing hypotheses of the ffi tjrpe is that 
0*1 should be the same for each group. If this is satisfied, 6 of equation (8) can be 
written equal to ^/cr*, where 

(y« — fit 

Then S3. 

Let ^ (Cl min.) be the value of ffe obtained by minimising ^ for variation of the 
parameters such that the set of populations belongs to Cl, and let ^ (o min.) be 
similarly defined for Then for p (Q max.) we must have 

min.), 


and also 


i.e. 


Thus 


Similarly 


whence 


cUr 


0. 



1 4, 

1 

VVStt/ 


^+ij 


Her* 3= ^ (Q min.). 


p (Cl max.) 
p (q} max.) 1 



N N 

he * 

(0 (Q min.). 


1 1 

N ‘ N 

(m min.)! 

® > 


Lp( 12 max,)_ 


(Cl min.) 
^ (« min.) * 


or is the ratio of two minimum sums of squares. 

To take a particular instance, suppose that trt is the same for each group and it 
is proposed to test whether fit , the slope of the regression line, is the same within 
each group. The class is defined by conditions A and B, and ai by conditions A, 
B, and O. For Q, ^ may be split up as follows : 


^ (H) * {(Ph ““ pt) ““ ^ ®e)}* + fi K^# — fit) (^ — ^ {y< fit ^t ““ ®*}*» 


* P. O. Mfthalanobis, Sankhpa, The Indian JarnnoX pf StaHeHee, Yol. i. pp. 114 and 12S. Additional 
zather fuller tablet have been p4epitted for pnfallioation in the Department of Applied StatietiiM, Unircareity 
OoUege, London. 
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the sums of squares having N ~ 2A?, A?, and k degrees of freedom respectively. 
Similarly 

0 (<»)'= ^ [iyti — yt) — {oBti - + s [{ht — ba) (icti - ^«)}* 

+ S {(ba - (a7e< - ^«)}* 4* ^ {^t - fixt - ctf }», 

these sums of squares having N ~ 2k, k — 1, 1, and k degrees of freedom. 

Minimising in each case with respect to the parameters at our disposal, is 
seen to be equivalent to making the terms containing those parameters vanish. 
Thus 

7 « ^ = ^ liyti ~ Vt) - bt (xti - ^t)l* 

* 0 (tt> min.) S {( 2 /«i - yt) - bt (tvu - + S 1(6* — ba) (ana - «i)}» * 

When is the same for each group, follows exactly the distribution of 
equation (9) with mi = (N ~ 2k)/2 and mz = (k-- l)/2, and the Incomplete B-function 
Tables can be used to obtain the exact significance level. If we are interested only 
in the 6 % and 1 % levels we can ask if 8 (f#)* ** 8 {(6t — 6a) (octi — «t)}* significantly 
large compared with 8 (ft*) *= 8 [(ya — yt) — 6* (xti — ^t)}**. These two sums of squares 
are independent and the test is thus equivalent to one of R. A. Fisher’s tests in 
the analysis of variance. We may proceed in fact by calculating 



where i^i-^(ft*), t^.«^(ft*), 

y*i * A; — 1, and /a ^ — 2A;. 

The significance of z may then be tested by using R. A. Fisher’s tables. 

In the hypothesis tested, which maybe denoted by 2fa,i, we have been concerned 
only with the slope of the regression lines within the different groups, i.e. in testing 
whether ^t is constant. We may, however, be interested in testing a variety of 
other hypotheses regarding at and fit which are all of the type, i.e. for all of 
which conditions A and B of p. 147 are assumed satisfied. In each case certain further 
conditions are assumed and others define the hypothesis to be tested. The resulting 
criteria of the likelihood method are precisely those which flow from the applica«> 
tion of the analysis of covariance technique, but a brief indication in summarised 
form of the solution in the case of four further hypotheses may be of interest. 

Finding after applying the test no reason to doubt we may assume 

that fit is constant and test the further hypothesis that the regression lines for all 
groups are coincident. This is 

B*nd . If is not satisfied we may still test the hypothesis that 
the population mean y*s for each group lie on a straight line when plotted against 
the mean Two cases arise: 

(а) when it is not assumed that fit is constant, i.e. when is not true. 

(б) ; when it is assumed that is true. 

* The meaning ol the oompanson is brought out bj Fig. 2. We are asking if the parts are of 
slgnifloanoe having regard to the magnitude of the ^urts . 
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The h 3 rpotheBi 8 that the regression lines for all gronpe are ooincadeitt 
may also be reaohed aftor testing first and then i.e. starting fixmi the 
assumption that H%^ 4 , is true. 

Keferenoe should be made to Fig. 3, which shows diagrammatioally the arrange- 
ment of the group regression lines when the various hypotheses are true. For 
and M% 5 the regression lines should have been drawn to coincide but they have been 
side-stepped a little in order that the groups might be distinguished. The conditions 
determining the classes Q and cs for each teat are set out formally in the second 
and third columns of Table 1. 



For each set of conditions the sum of squares ^ /S^ {ye< — A can he 

split up in a different way. For instance, when condition E holds, ^ can in the 
first place be analysed into 

{{yn — (<*?*< ^e)}* + 8 {(6f — (ma — -f ^ «« — a*}*. 

The condition then permits* the last sum of squares to be written 
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and this breaks down further into 

8 {yt - Ixt - m}* *» fir {{yt - ^o) - h^(Wt - ^)}* 

4- fir {(6m — 0 (^« ~ ^o)}* + S {J^o ““ ^0 «i}* 

The algebra of the analysis of in other cases need not be set out here. The Lt 
criteria for the different Ift hypotheses have been shown to be given by 

L ^ ^ min.) 

min.)* 

and it is seen that in each case can be written in the form Va/(vi + v%), where 
and vg are independent sums of squares. The appropriate vi and vg and their 
respective degrees of freedom /i and /g are given in Table I *. 

The exact significance level of Xg can be found by using the Incomplete 
B-function Tables with and or alternatively, if we require only 6% 

and 1 % levels of significance, we can use z, where 



Something further should be added about the hypotheses and where 
the linearity of the means of the grouj^ is considered. They differ from the others 
in that the relation being tested concerns not only the constants cr^, /Se, and at but 
brings in also the mean or’s. We are asking whether condition E can be true, 
i.e. whether + m (< = 1, 2, . . . k). In the event of a positive answer 

being received to this question, it is not permissible to say that no matter what 
values Xt had been obtained the corresponding expected mean y*s would be linear. 
Indeed if this were so, then it is obvious that we must have I and at ^ m, and 
thus have a common regression line for the whole data. It is however permissible 
to draw conclusions about the linearity of the means for any other sets of x*h which 
lead to the same values xt^ provided only that they satisfy the initial condition A. 
For we saw that the relation ^ enabled us to break down ^ in 

such a way that it was possible to say that {(ye — yo) — 6m (®« -- ®o)}* was 

distributed as with f^vmk^% and this takes no account of the way in which 
Xt is made up. It follows that the distribution of Xg, when condition E is satisfied, is 
independent of the partitioning of Xf Just as when testing relations which concern 
only <re> Pt* fuid ae, it is possible to start by considering a fixed set of a?’s and then 
to remove this restriction, so in the present case owing to the nature of the 
distribution of the criterion we are able to extend our conclusions to any set of «*s, 
subject only to the condition that 2^^ (f — 1, 2, . .. k) does not change, and that condi- 
tion A remains satisfied. 

^ That V, and vg in each case are diatribnted aa follows from the theorem already referred to. It 

will be found helpful to compare the sums of squares required for the different tests with the parts 
etc. shown in Fig. 2. A convenient form in which to calculate the sums of squares was set out by 
B. 8. Pearson, in Appendix 1 to B. H. Wilsdon's paper, Supplemmit to Journal of Royal Statiwtieal Society y 
Vol. s. p. 180. The notation there used is the same as that of this paper. 
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If the question at issue is not whether the means yt are linear but simply 
whether they differ significantly, then instead of condition E we shall have to test 
whether 1, 2, ... h). The sum of squares ^ 

become 

S (5t - m)* « (ye -*■ yo)* + (yo - m)* 

and the Lt criteria will be modified accordingly, S (ye — yo)* with /i = ft — 1 being 
used instead of S (ye — yo) — — ^o)}® with yi « ft — 2. The restriction on the a?’s 

is the same as before. 


VII. The Hypotheses H. 


Only one test of the H variety was treated in § IV. Without going into much 
detail, it is of interest to note that other hypotheses of this type can be con- 
sidered in much the same fashion as were the hypotheses H%, The criterion L 
each time takes the same form : 


/.= 








< 13 ). 


Xq is the only part that changes, and as it does the moments of equation (8) 
will be modified. In fact, if f is the number of degrees of freedom of Xq* then the 
moments of L about zero, where there are n observations in each group, will be 




APT 



PC") i 


,(14). 


The complete hypothesis considered above was a particular case where f was 
2ft — 2. As one further instance of an H test, suppose that no other assumption 
than condition A is made and we wish to test whether conditions B and C hold, 
i.e. whether at — a and = This differs from the test in that constancy 
of at is not now assumed, but is one of the elements to be tested. The L criterion 
is seen to be 


n 






— 2 {(yw (®M — ^01* 

We i 


^ s Ky« - Vt) - K (<Bu - ®t)}* 


Putting L into the form (13), we have 

^ ^ {(f>t — ^<)}* 

which has ft — 1 degrees of freedom. The moments of L follow by putting/»ft — 1 
in (14), and then the 5 % and 1 % levels may be obtained by the method outlined 
on p. 152. 
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Vin. Illustrative Example. 

The data which wiU used to illustrate the tests described above was col- 
lected at the Building Research Station. From the point of view of the Ht tests, 
it has already been discussed by Mr B. H. Wilsdon. The problem concerns the 
relationship between crushing strength of concrete and mortar, and the particular 
data considered here are those on which his Table I (<?)• is based* Corresponding 
tests were performed on dry mortar (a?) and normal concrete (y) made from eight 
different cements (w = 8, i»l, ... 8), and the results fall into four groups (^=*4, 
*.. 4) corresponding to four different ages of mor^r and concrete. The 
relevant information is summarised in Table II. 

First it may be asked whether the regression of concrete strength upon mortar 
strength can be represented by the same regression line for all four ages, and also 
at the same time whether the variance about that regression line is the same for 
each age. This is testing the complete hypothesis if. To calculate Z, insert into 

(7) the values from Col. (5) of Table II, i.e. for 2 {(y« — y«) — bt («?«— £^)}* 2, 3, 4), 

« 

88, 644, 1609 and 1772 respectively, and for S {(y« — yo) — 6o 12637. It 

is found that Z = *2008. 

From (8) the moments of Z are 

Mean * *7038874, * 0128115, i.e. a « *1132. 

Following the method given for approximating to the 5 % and 1 % levels of Z, 
we have from (10) 

9*967092, m*« 4*19297, /*« 19*9342, /i«i 8*3859, 

whence interpolating in R. A. Fisher’s x-tables, 

j *4435, = *6302, 

tz.aa « *4947, Z.oi « -4026. 

Z is clearly significantly small and the composite hypothesis must be rejected. 

We may now investigate more closely where the heterogeneity lies. To test only 
whether the variances are equal, i.e. the hypothesis Hi, the appropriate criterion 
is that of equation (11). Substituting the vadues from Table II, we have Zi** *6168. 

The lower 5 % and 1 % levels of Zi may be obtained from the tables already 
referred to, or otherwise directly from (12) we have 

Mean « *8798690, « 0078003, i.e. <r « *08832, 

giving twi =a 11*0437, m** 1*5080. 

* Loc. cit, p. 184. The figuses given in the present paper are not exactly the same as those in 
Mr Wilsdon’a table owing to the fact that the original hgures as they were given to me were later altered. 
The disorepanoies do not materially affect the oonclnsions reached. 



160 Problems in the Analysis of Regression among h Samples 

Instead of proceeding by calculating fi and and interpolating into the 
^'tables, the exact significance level of Lx may be found by evaluating 

J5^«8 (110437, 1-6080). 

We obtain the chance that Lx < -6168 to be *0126. It is thus unlikely that at is 
constant for all ages. 

Since the variances may differ there is doubtful justification in proceeding 
to test the hypotheses iSTa, but the results are given in Table III for the sake 

of illustration*, z in each case is J log^ » where vx and Vg are given in Table I 

and may be calculated from the figures of Table II. 


TABLE III. Summary of Hx testa. 


Hypothesis 

z 

fi 

A 

^•05 

*•01 

Significance of z 


Negative 

3 

24 

•6608 

*7757 

Not significant 


1-3866 

3 

27 

•6427 

•7631 

Significant 


•6718 

2 

27 

-6051 

•8513 

Doubtful 

St., 

1-7686 

1 

29 

•7165 

1-0139 

1 Significant 


These tests would allow us to accept the equality of the regression coefficients 
for each g^oup and perhaps the linearity of the means. The equality of the within 
groups regression coefficient {fi) and the regression coefficient of the means (Z), 
i.e. the hypothesis of a common regression line for the whole data, would have to 
be rejected. 

In conclusion I should like to express my thanks to Dr E. S. Pearson for 
suggesting this problem to me and for the help he has g^ven me at every stage 
of the work. 

* The reader may be referred to a sampling experiment carried out by E. S. Pearson in the case of 
a single variable, which suggested that the test was not affected even by quite large changes in trt from 
group to group. See BiofMtrika^ Vol. xxi. p. 846, 1929. 








ON AN IMPORTANT CLASS OF STATISTICAL HYPOTHESES- 

By STANISLAW KOLODZIEJCZYK. 

(Prom the Biometric Laboratory, Nencki Institute, Soc. Sci. Lit. Varsoviensis, 

Warsaw.) 


1. Introouctoby. 

Dr J. Nbyman suggested to me as a toj>ic for my thesis for a doctor’s degree 
a problem which may be stated as Allows. 

Assume that it is known that each of the n independent variates 

: ( 1 ) 

follows the Gaussian law of frequency, 

- (<g< - 

<“>■ 

with a common standard deviation, o-. The mean values, ?n<, of the variates (1) are 
connected with s unknown parameters 

®r+l» • • • Ct# ^ ^) (3) 

by means of the formulae 

97»i«cCi<aiH-Ca<aa + ... + (t »• 1, 2, ... n) ...(4), 


where the coefficients are known and form a non-singular matrix 






,( 6 ). 


The problem to be solved consists in testing the statistical h 3 rpothesi 8 , that 
r known linear and independent functions of the ^parameters (3) have some definite 
given values, for instance that 


hiiai H- bvOm 4* . . . 4* »= for » 1, 2, . . . r < a (6), 

where the coefficients b and the constont terms £9 are known numbers. 


It will be convenient to call a statistical hypothesis of the above tjrpe a linear 
hypothesis. 

Different aspects of the problem considered have been already treated by many 
authors, as by A. A. Markoffd),/* Student” ( 2 ), B. A Fisher ( 8 , 4 ), J. Neyman and 
B. S. Pearson (S, S) and by J. Neyman (7). The points of view of these authors were 
different. I shall treat the problem in its general form following the lines indicated 
by Neyman and Pearson (S, t and lO). It will be seen that some of the oalculations 
1 shall have to perfomi and some of the results I shall obtain are identical with 
those already published the authors quoted above. 

Bfomatrito xsvsi 
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2. Examples. 

Before I proceed to the solution of the problem it will perhaps be useful to 
illustrate its generality. 

Example 1. Suppose that 1, and that the equations (4) reduce to the following: 

(is=s 1,2, ... n) (7). 

In this case the hypothesis, H\ reduces itself to that of “Student,** namely that 

Oi — ai® (8). 

In fact the variates (1) may be considered as forming a sample for one unknown 


univariate normal population, tt, and the h 3 rpotheBis to test is that the mean of 
the population tt is equal to ai®. 

Example 2. Suppose that s — 2, while the equations (4) are 

mi = ax for i 1,2, ... r (9), 

and mj = a^ for J^r+l, r4*2, ...n (16). 

The hypothesis, that 

= = 0 ( 11 ) 

(corresponding to the equations (6)) may now be formulated as follows. The variates 
(1) may be considered as forming two samples, 

... ( 12 ) 

and Xr^Xy (13), 


from two normal univariate populations tti and 7r2, with common unknown variance 
and the hypothesis tested consists in the assumption that the means of the 
populations tti and 7r2 are equal. This is what we may call the generalised hypothesis 
of “Student.** 

Example 3. Assume s > 2, 

n = ni + WBH- + (?i£>l) (14), 

and suppose that the equations (4) are 

m( = ai for t = 1, 2, ... Wi, 

and ni^«=aa for / = wi + 1, -H 2, ... ni 4- r? 2 , 

(16). 

ib-l k 

generally mf « a* for 2 < t ^ S n^, 

where 1, 2 , ^ 

Consider the hypothesis, which may be expressed by means of s — 1 equations 
of the form (6): 

ajc^ a,^ JB*® = 0 for ^*=1,2, ...s — 1 (16). 

It will be easily seen that the variates (1) may now be considered a6 forming s 
samples from a normal univariate populations tti, 7 r 2 , ... w,, with a common standard 
deviation or. The hypothesis to test is, that the means of all the populations are 
equal. This is the case considered by Neyman and Pearson (hypothesis H% in their 
notation) and also by R. A. Fisher in his analysis of variance. However it will be 
seen that the statement of the problem and the point of view of the solution will 
be different from those of Fisher. 
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Ewamph 4. Put « 2 and assume the equations (4) to be of the form 

na^ax^ a,y< (t « 1, 2, . . . n) (17), 

where the numbers yi are known. 

We may now consider three important hypotheses: 

Hx, assuming that ( 18 ), 

JSTt, assuming that a, « ot® (19), 

JSTs, assuming that ai + ci*F«Z (20), 

where ai®, a*®, X and F are given numbers. 


The hypotheses Hx^ H% and are connected with certain problems of correlation 
theory. Consider a population w of individuals with characters x and y and suppose 
that for each value of y the character x is normally distributed about its mean, say, 

my^Ot^oxy ( 21 ), 

with the standard deviation <r, independent of the value of y. The population tt 
may be considered as a set of populations, say iTy. each of them being determined 
by the value of the character y, common to all its elements. The variates (1) may 
now be considered as a sample drawn from the population tt. If besides the values 
of (1) we know also the corresponding values of the character y, 

yi» yi> ••• yn * (2^)> 

where some of the y's may be equal, but at least two of them should be different, 
then we may consider each value of d? as belonging to an individual, drawn at 
random from the partial population tt^. 

The hypothesis Hx assumes that the constant term in the regression equation 
(17) has a certain definite value. This hypothesis, in its application to some problems 
of agricultural experimentation, has been considered by J. Neyman(ll). 

The hypothesis Ht concerns the value of the regression coefficient of x on y. 
Its test is to be found in the book by R. A. Fisher (l«). * 

Finally the hypothesis has been dealt with by H. Schultz (18) and — ^in a 
more general way — by Miss K. Iwaszkiewicz and concerns the error of the ordinate 
of a fitted regression line. The result obtained by K. Iwaszkiewicz is published — 
with a suitable reference — in a paper by J. Neyman (7) showing its importance to 
some agricultural problems*. 

The hypotheses considered in Example 4 may be easily generalised for the cases 
when the regression of x on y is not linear. This hypothesis in its general form is 


treated in Fisher’s book (18). 

Example 5. Put s « 2r and assume 

n«ni + nt+ ... -fnr (»'>1) (28). 

The equations (4) will be written 

for S njt < % < S njt » 1, 2, . . , r) (24). 

k^l k»l 

^ The method of teetins the hypothesis Hg is given in g 8. 
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In applications we have sometimes to consider the hypothesis B\ expressed by 
means of r ~ 1 equations 

and also the hy^thesis H%y that 

— a*,..! « = 0 

Oik-atr^ “ 0 

These hypotheses may be expressed in statistical terms as follows. Consider r 
populations tti, ... tt^ of individuals having some characters a and y. Suppose 
that the regression of w on y in each population is known to be linear. Suppose 
further that, whatever the population TTi and whatever the specified value of y, the 
variance of w in the corresponding y-array is known to have the same (though 
unknown) value The hypothesis Hi consists in the assumption that the 
coefficients of regression of a? on y in all r populations have equal values. The 
hypothesis H^ is more stringent and assumes that not only the coefficients of 
regression but also the constant terms in the regression equations have a common 
value in all populations tti, tti, ... Wr, thus that the regression lines of a? on y are 
in all populations identical. 

As far as I know these hypotheses were first considered by Miss K. Iwaszkie- 
wicz(i4), who showed their importance in agricultural experimentation and in some 
industrial research problems. 

Further examples could be easily given, but I think that those which have been 
described above, together with those given in §§ 6 and 8, are sufficient to show 
the generality of the problem. 

It is here assumed that, before applying the results of this paper to any group 
of experimental data, a test has been applied to show that the assumption of the 
standard deviation of the a?'s being constant is actually satisfied *, 

3. Simplification of the Problem. 

Before I proceed to the solution of the problem, I shall reduce it to a simpler 
form. 

It may be shown that the problem of testing the hypothesis H, expressed by r 
independent linear equations with regard to the parameters a with known 
coefficients, can be reduced to that of testing the hypothesis, say J?i, ascribing to r 
of the parameters a some definite values. In fact, the equations (6) being independent, 
we may solve them with regard to some r parameters a. the order in which 
the parameters are numbered is of no importance, we may assume that the solution 
gives 

dig =» dig (-Bi®, Bj®, • . . BfP , df.^1 , ^ 94 . 2 , • . . Gtf) (A? « 1 , 2 , ... r) . . .(27). 

* This must either have been previously ascertained or else it must be done on the data themselves. 
In the case of bivariate populations this certainly would require that there should be enough observations 
of the *’8 with the same y to determine with reasonable accuracy the array variance. 
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StibBtitatmg the above values of (k ■« 1, 2 , ... r) into the equations (4X we 
get similar linear equataons, say* 

+ +c«<' a« (^8), 

with known coefficients e\ Now it may be assumed that the means tn< are linear 
functions of the ffirm (28) of s parameters 


••• Srr ^+if (W-hti* ••• ....(29) 


and that the hypothesis H determines the values of r of these parameters, namely, 

(/b-l. 2,...r). 


Thus if we are able to test any hypothesis Nj, we shall be able to test also the 
hypothesis of the type H. 


4. Solution based on the Pbinoiple of Likelihood. 


I shall now give a solution of the problem of testing the hypothesis following 
the lines indicated by J. Neyman and £. S. Pearson (S), that is to say the solution 
based on the principle of likelihood (in the sense of the word used by Neyman 
and Pearson — not by R. A. Fisher), This is as follows. The first stage consists in 
stating CX, or the set of admissible simple hypotheses. 

We shall include in the set any hypothesis, ascribing to the unknown 
parameters a, ai, Os, ... a« any definite values, with the only restriction that a >0. 

The second stage consists in defining the set co of simple hypotheses belonging 
to the composite hypothesis. Hi , to be tested. 

The set to will contain any hypothesis which may be described by 

••• f ®r+l» ••• ^ ( 80 ), 


where the a/^a are specified by the hypothesis tested. Hi, while ar+i, Or+s, ... a, and 
a > 0 are arbitrary. 

Further proceedings are as follows. We suf^ose that the observations provide 
us with values of the variates xx, 00 %, We have to calculate the simultaneous 
frequency function of these variates, determined by a simple hypothesis contained in 
O. Let itbe ...arn). Obviously p (a?x» .-.-«») will depend upon the values 

of parameters Oi and c. We keep the x*b constant, and choose out of the set O the 
simple hypothesis for which p (xx, Xu , ... ^n) is maximum. This absolute maximum 
ofp (xx, ^, . . . Xn) will be denoted by pa* Next we keep again x*b constant,and choose 
out of the hypotheses included in to the one for which p (xx, x%, ,,, Xn) is maximum. 
This will be a relative maximum and will be denoted by p*,. 

According to the principle mentioned, the test of the hypothesis Hx should be 
based on the values of the likelihood (in Ibe sense of Neyman and Pearson), namely. 


X* 


& 

Pa 


(81). 


If for any set of bbeervational data X is small, this means that the set of 
admissible hypotheses Contains some of them, from the point of view of which the 
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facts observed are much more probable than from the point of view of the hypo- 
thesis tested. In these circumstances we are intuitively inclined to reject the 
tested hypothesis. In order to judge whether the value of X is ‘‘small” or “large,” 
we consider the probability P of rejecting unjustly the hypothesis tested, that is 
to say the probability of rejecting H\, when it is true. In most practical cases we 
may agree upon the value of P which may be considered as “small.” For instance, 
we may accept the view that it will do no great harm if we reject the hypothesis 
unjustly not ofbener in the long run than say once in a hundred times. This 
arbitrarily chosen value of the probability of an error in rejecting the hypothesis Hx 
has been termed by Neyman the confidence coefficient €. The choice of c deter- 
mines the limit between “ small” and “ large ” values of X. Denote by 

P{(x<Xo)jiri} 

the probability, determined by the hypothesis PTi, that the observations will give 
us the value of X<Xo, where X© is any number between zero and unity. Choose 
now Xo, so that we have 

P((X<Xo)|^i}=»e (32), 

and consider it as a limit between the values of X “small” and “large.” Obviously, 
if we agree to reject the hyjiothesis tested when X^X©, the probability of rejecting 
Hx unjustly will be 

P = </> (Pi) P {(X < X©) I Pil < P ((X ^ X©) I Pi] « € (33), 

where (Pi) < 1 denotes the unknown probability a priori of the hypothesis Pi. 

Thus the test of the hypothesis Pi requires the knowledge of the probability 

P {(X < X©) I Pi], 

or of the frequency distribution of X, determined by the hypothesis tested. Pi . 

Having thus described the problem, we shall proceed to its solution. 

The simultaneous frequency function of the variates (1) will be 


p{xxy X2 




S 

i*i 

2<ra 


The process of maximising (34) with regard to the Oi and <r is simpler, if we try 
to maximise logp (a?i , a-g, • • • «?n) instead of (34). Taking into account the equations 
(4) connecting the w*s with the parameters a, we may write 




0 


(35), 


d log p 15 / 

~daf ” - o^a,) cji « 0 


(36), 


n n 

where x" =* “ .2 («< - CiiUi - o*iaa - . . . - c^a,)* (37). 


Obviously the equations (36) are those which we should get if we wanted to 
minimise The possibility of their solution is guaranteed by the condition con- 
cerning the matrix (5). 



BtAJHISLAW KOhOtmBJi3ZYK 


i6r 


Denote by nS^* the minimum value of (87). This will be obtained solving 
(36) with regmd to the a*B. Denote by 

tti, «!, ... a, (88) 

the solutions of (86). It is ecusdly seen that each a is a linear function of the 
nSa* is obtained from (87) by substituting a’s instead of the a"s. The equation (36) 
will now reduce to the following : 

(89). 

It is easy to prove that by substituting the a*s instead of the a% and 8^ instead 
of <r in (34), we actually get the maximum value of this function. It will be 

<“)■ 

The process of finding is quite similar and we get, say, 

: 

where n(Sa^ + stands for the minimum value of the sura of squares (S7), 
calculated under conditions that 

ai — oi® as a* Os®, ... Oy « (42), 

while other parameters remain arbitrary. 

The likelihood is now easily found : 



We see that the value of \ depends upon the value of, say, 

■^-1 <«)■ 

Thus instead of considering \ as the appropriate test and rejecting the hypo- 
thesis when \<Xo, we may use Z or and reject the hypothesis Hi whenever 
Z'^Zq, where Zq must be so chosen that 

P{(Z^Zo)\Hi]^€ (46). 

To get the expression of the left-hand side of (45) we have to start from (34), 
where instead of the m*6 we have to put their expressions in terms of the a*B 
appropriate to the hypothesis Hi, 

Simple calculations show that there exists the identity 

X* *= .2 (a?< — m<)* a* nSa^ + nSo* (46), 


where nSo* » - Oi®) Ct* («• - a*®) 4- . . . + (a, - a,®)]* (47), 

while n8a* « S («?< — ouai — — ... — (48). 

* This formula shows that the condition stated in (8) on p. 161, that the number $ of parameters 
should be smaller than the number n of variables, is essential. In fact, if it were szsn, then 8 ^ would 
be aero and so would X. Thus any result of the observation would lead to the rejection of the hypothesis 
if the test were upon the principle of likelihood. 
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It is to be noticed that SI is entirely independent of a,+i, ar+%, ... a,. Thns 


we may write 

®1. ... ®n)“ ( 7 ^-) « ^ (^®)* 

Vo* 

Now introduce new variates^ say, 

Ei, Et, ... E„ y,+i, y,+a, ... y„ (60), 

connected with the ay’s by means of the formulae 

soi » 4* C\iE\ 4- Ca; J^2 ”1" ... 4* <6# 4- 4- ... 4 Cn^yn (51 )» 

where c« for A? < « are the given coefficients of the formulae (4) and for k>8 are so 
chosen that 

ft 

S ChiCji = 0 if A; and j> s (52), 

and fory>« (53). 


It is easily seen that it is always possible to find coefficients satisfying the 
above conditions, provided the matrix (5) is not singular. 

In order to obtain the simultaneous frequency function of the new variables 
(50), we have to substitute (61) into (49) and to multiply the result into the 
Jacobian of the transformation. This Jacobian, say G, is certainly not zero. In 
fact it is easily seen that owing to the special choice of 0,4.*,**, satisfying (52) and 
(53), the square is equal to the square of the matrix (5) and thus is positive, 
not zero. 

Owing to (4) the equation (51) may be written in the form 

oci = Cu {Ex 4 ai°) 4 c^i {E 2 4 a*®) 4 . . . 4 {Eg 4 a,®) 4 Cg^x^i 4 ... 4 Cniyn 

(54). 

Multiplying each of these equations into c^(^‘=sl, 2, ... s) and summing, we 
see that the terms in y’s are cancelling, owing to (52), and further that the sums 
Ej^ + ajc are satisfying the same linear equations in terms of the x*s as the a's, 


namely the equations (36). We thus have the identities 

forifc=l, 2, ... r (65), 

and for A; « r 4 1, r 4 2, ... 5 (56). 

Using this we have 

— Cuax — CtiUa — ... — 4 ... 4 Cnt^n (57), 

Squaring (57) and summing we get, owing to (48), (52) and (53), 

nSI - 2 y** (68), 

n 

while nSo® — 2^ (ci< iS'i 4 Ck 4 . . . 4 CgiEg)* (69). 
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3[f we 'sabutitiite (51) into (S4) and multiply the result by we get 

p(JE\t E% Egt VMt ••• yn)*< 5 on 8 t. x e ^ (50), 


and it is easily seen that, owing to the fact that Sa^ is dependent only upon y and 
8^^ only upon E's^ the variates y« are independent of the E% and besides are 
independent among themselves. The expression nSa^ being a sum of n ^ s in^- 
dependent normal variates, it is known that the distribution of 8^ follows the law 


p (8a) « const. X 8a^^^ e (61). 

It follows that the simultaneous distribution of 8a, Ei, Et, ... is given by 

Sa* + St^ 

p(8a, E\t E%t ... £«) M const. x8a^^^s ^ ..(62). 

Further on we shall use this result. 


Our problem being to find the distribution of 8i,/Sa, we have now to consider 
the process of calculating 8^. 

I have alresuiy pointed out that n8a* is entirely independent of the a*s. Thus 
when calculating the minimum value of the sum x* (^) with regard to the 
arbitraiy parameters Ur+i, Or+i, ... a„ we are really trying to minimise the sum 
of squares nSo^ in (47), where the a's and the a<®’s are to be considered constant 
and the ar+i^'s are arbitraiy. Thus 8^^ will be the minimum of So** It will be a 
positive definite quadratic form in r variates Ei, Eo, ••• Er* As these have been 
found to be independent of the y's and thus of 5 *, 8t is also independent of 8a, 
and, as is easily seen, follows the distribution 

* n8j^ 

p (Sb) =* const. X 8b^^ e (63). 

Therefore the simultaneous distribution of and 8b, is given by the product 

p (Sa , 8b) « const. X Sb^^ e ^ (64). 

Introducing instead of 8b the new variate 

8b^8aZ (66), 

^nS2l±Il 

we get p (8a, Z) « const x e * gr-i ( 50 ^ 

Now on integrating with regard to 8a within the limits 0 and oo , we get 

n-(6~r) 

p (Z) «« const X Z^^ (1 -f Z*) ^ (67), 

and the problem may be considered as solved. The tests obtained are those in- 
vented and applied in many cases by R. A. Fisher. 

The expresmons 

are considered by him as independent estimates of the common variance. 



170 


On an Important CUtM of Staiistietd Hypotheses 


The method of testing is reduced to the calculation of 

* - i log = log ^ + i log ^ (69), 

and to referring to the table of values of z corresponding to the confidence ” 
coefficient €«*06 and € = ’01, given by R. A. Fisher, where Wi and n% correspond 
to my r and n. — « respectively. 

Another method of course would be to substitute in (67), say, 

+ (70), 


and in obtaining 


p(w) = 



1 


(l-u)® 


1 


and in testing whether 
where t/o is such as to have 


(1 Wo 



(71) , 

(72) , 

(73) . 


The value of may be found from the Incomplete Beta-Function Tables*. 

Of course if r = 1, then the distribution of Z reduces itself to that of “ Student's” t. 

It is interesting that the tests of R. A. Fisher are consequences of the principle 
of likelihood, stated above. 

I think it will be useful to finish this section by summarising the mechanical 
process of testing any hypothesis of the type H and in illustrating it by one 
example. 

The steps in testing a hypothesis H are as follows : 

1. State the set of admissible hypotheses H and test whether each of them is 
describing the distribution of any variate (i = 1, 2, ... -a) by means of the formula 


piXi) 


2(r» 


(74), 


or V2ir 

where a is independent of t, and mi may be represented by a linear function of 
certain 8<n unknown parameters a. 


2. Express the hypothesis H tested by means of independent linear equations 
(let r be their number) with regard to the parameters a. If this is possible, then : 

3. Calculate the minimum value of 


X* (^i — (75) 

which it may have for any set of values of the parameters a, corresponding to 
admissible hypotheses contained in H. This minimum value will be nSa*. Divide 
the minimum obtained by the number of degrees of freedom, equal to w, the 


* K. Pearson, Tablet of the Ineon^lete Beta Function. Biometrika Office, 1984, 
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total number of observations, minus s, the number of parameters a. The ratio 
will be 


(76). 


4. Calculate the minimum value that x* have for values of the para^ 
meters a, satisfying the r linear equations, expressing the hypothesis to be 
tested. The minimum will be Subtract obtained previously, 

and divide by the number of degrees of freedom, equal to r, the number of 
independent equations expressing H, The result will be 




,(77). 


5. Calculate z, according to (69), and refer to Fiaher^s tables. 


5. Example: The Pboblem proposed by K. Iwaszkiewicz. 

Obviously the above five steps must be divided into two parts. Everything 
up to and including the determination of formulae giving Rod Si,* may be 
regarded as theoretical work. The practical test will then consist of numerical 
calculations leading to z, for which different formulae simplifying the arithmetic 
may be given. 

The example I shall consider is that of the problem proposed by Miss K. 
Iwaszkiewicz, to test the hypothesis H whether the regression coefHcients of ^ on y 
in k different populations, from which we have samples, are equal. Suppose that 
the sample, 2<, from the ith population, «•<, consists of n< individuals, for which the 
values of x and y are respectively 

ya, 

We consider each population as being subdivided into partial popula- 
tions, say 7r<y, according to the values of y, appropriate to the individuals of tt^^. 
Each individual of the sample Xi may be considered as being randomly drawn from 
a separate population 7r<y. 

We consider as given that the regression of a; on y in each population Wi is 
linear. This is equivalent to the assumption that the mean, niiy, of the population 
TTip is equal to 

miy-aa+ (79), 

where Oa and are some unknown parameters, the number of which is s » 

If there is evidence that the variances of the within single partial popula- 
tions iTiy have equal values we may proceed further. 

The hjqpothesis to be tested may be expressed in 

(80) 

independent linear equations in terms of the parameters a, namely, 

oa-atM^O (i- 1, 2,...&- 1) 


(81). 
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We consider the sum of squares, which may be written 


« I 2 (ary - oa - (82). 

The process of minimising this sum is easy, and we find 

I wccr*,*(l-.r,)* (83), 


where cTasi and r* mean respectively the standard deviation of ar's and the coefficient 
of correlation between x and y in the sample 2j. 

* Now we shall have to assume that the parameters a satisfy the equations by 
which the hypothesis H is expressed. Denote by oa the common valub of a^. 
Obviously nSc? -h nS^ will be the minimum of the sum, say, 

Xi® = 2 2 (ary — oa ~ uayy)* (84), 

i3=l JXSl 

with regard to all possible values of the parameters a% and a<i» ibr 
After easy algebra we have 

n = 2 ^ (86), 

2 ntay* 

i=l 


where a-y^ means the standard deviation of y*s in 
gives us the result 

k-l k 




2 

k 


2 


ruay^ 


2f. Some further easy algebra 
(a< - aj)* 

( 86 ). 


where ai=ri-^ (87) 

^Vi 

means the regression coefficient of a? on y in the sample 2,*. The above formula is 
certainly not the most convenient one for calculation, but it shows the connection 
of the test with the differences in the regressions within single samples. The 
formulae which are the most convenient for calculation are, as usual, the less 
elegant ones, and may be easily deduced directly from the process of minimising 
(82) and (84). 


6. The Method of Testing the Hypothesis H, based on the Best 

Critical Begions. 

We shall now treat the same problem from the point of view of the theoiy of 
Best Critical Regions If we consider the data of the observations 

xx, a?*, ...a?n (88) 

as co-ordinates of a point 2 (the sample point”) in the n-dimensioned space (the 
sample space W), then any method of testing the hypothesis AT consists in a rule 

* Neyman and Pearson, ** On the Problem of the most effiolent Tests of Statistical Hypotheses.’* 
(See reference 9.) 
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of rejecting whenever the sample point lies within a certain region, mj w, in the 
sample space. The region w will be called the critical region ; the supplementaiy 
region the region of acceptance* If the hypothesis H is a composite one 

to be tested, thus leaving one or more parameters determining the simultaneous 
distribution of the a!*B unspecified, then the appropriate critical region w shcmld be 
** similar’* to the sample space with regard to these unspecified parameters. I shall 
now explain this concept having in view the particular hypothesis H, considered 
in previous sections* 

The hypothesis JI does not specify the parameters 

^+tf ••• ,.(89). 

We shall say that a region fv is similar to W with regard to these parameters 
and that it has the power c, if the probability, say P{(2ew)l jfiTj* determined by 
any simple hypothesis belonging to that the sample point S will fall within w is 
equal to e, whatever he the values of the parameters (H9), unspecified by H, 

In order to define the best critical region, we have to consider an alternative 
simple hypothesis, say ht* We shall say that wq is the best critical region with 
regard to having the power €, if 

(1) Wq is similar to W with regard to (89) and has the power €, and 

(2) if the probability determined by the alternative hypothesis hi that the 
sample point S will be contained in is larger than (or at least equal to) the same 
probability corresponding to any other region, w, satisfying (1). 

We see that the process of finding the best critical regions must be preceded 
by finding the regions similar to TT. 

Neyman and Pearson gave the method of determining the most general regions 
similar to the sample space, the application of which is possible under the following 
conditions : 

(a) The simultaneous probability function p[{Ki^ ...a;n)|ifl should possess 
the derivatives of all orders with regard to all parameters unspecified by H for every 
set of values of osi, ,,, 

(b) If a is a parameter unspecified by the hypothesis, then denoting 

««). 

we should have, say, 

(91), 

where A and B are independent of the ^s. 

(o) The hypothesis H does not specify the values of s — r + 1 parameters. 
Assume that these parameters are written in any order and form a series, say, 

-••^4— r-J-l ....*..(92). 

* The notation need in thin formula, Z«i£>, means t the point 2 is an element of ip. It is generally 
used in tbe theory of sets, ^he letter « standing here for **element'* should not be confused with « 
denoting tiie power of the Teflon w. 
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Denote by the function defined as in (90), 

<“'■ 

and consider the hypersurfaces 

» (7^ « const. 0* 2,...s — r+ 1)...(94). 

Fix the values of the parameters a and O and consider the intersection of the 
t — 1 hypersurfaces (94) corresponding to j ■» 1, 2, ... t — 1. This intersection will 
be denoted by 8(ai\ Ci, C^, ... Ci_i). Keep the values of a*s constant and change 
the 0*8 in an arbitrary way. Doing so we shall obtain a family of intersections 
8{ai] Oi, Ot, ... This family will be denoted by F {at). The condition (c) 

requires that it should be possible to find such an order of parameters unspecified 
by the hypothesis tested, that each family F(ai), for i = 2, 3, 1, should 

be independent of a,-. This condition means that given any two different values 
of cci, say a/ and any surface Sia/; Ci, ... Ci^i) included in the family F (a/) 
should be included also in F{ai*), corresponding perhaps to some values of the C*s 
different from those to which it corresponded in jP(a/). 

If the above conditions (a), (6) and (c) are satisfied, then the method of finding 
the similar regions may be applied and the best critical region is constructed in the 
following way. 

Consider the intersection of « — r + 1 surfaces (94) corresponding to fixed 
values of parameters (compatible with the hypothesis tested H) and to some fixed 
values of the constants C, This will be denoted by W (Ci, (72, ... Cg^r+i)- Denote 
by wo(Cx, (^ 2 , ... Cg^r-^x) the part of the hypersurface W ((7i, 0 %^ ... defined 

by the inequality 

p{ia)i, Xi , ...»»)! All «•*, ... *(Ci, O',. ... <7._r+i) ...(95), 

where k{Ci, <7*, ... 0,^r+i) must be found for any system of the C’b so as to satisfy 
the condition 


[[...[ p{(xi, ...Xn)\H]dw(Ci,...C,^+l) 

JJ J r+i) 

fi"' fwrir n p{(a!i, — a>n}\S]dW(Ci....C^-r+i) (96). 

JJ J tv {Oi, ,,, 

The best critical region wo is then built up of regions i^^o((7i, ... (7#-r+i) 
corresponding to all possible values of constants (7i, Ci, ... Cg^^x* 

We have now to consider whether the probability function ... a?«)| JEf } 

satisfies the conditions (a), (6) and (c). 

Ad (a). The condition (a) is obviously satisfied. 

Ad (&). Turning to our previous notation concerning the parameters, we shall 
have 


logp {(«!, «,,...«„) I JET} 


i — n log V 27r — n log o- ■ 


n 

^2 (Xi — Cliax—... — CriOr — Cr+X,<<lr+1 — ... — O^a,'^ 


2<r* 


( 07 ). 
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^ ^ 2^ («, - ^2 0 j<a^)» (98), 

.*.(99). 

Thus <f>^ satisfies the equation (91). Consider now, say, 

4>,i 2 («,- i etiat)cji ( 100 ). 

aaj ^*4ml tml 

Obviously --p 2 ^ 0 / (101), 

and satisfies (91) as it is independent of the ^r’s, ^ ^ assumed to be zero. 

Ad (c). The h 3 rpothesis (c) is also satisfied. To show this we write the para- 
meters, which are not specifi^ by the hypothesis IT, in the following order: 

CCi =s» Oa (lr +29 • • • ®#— r — O' (102). 

For every — r the hypersurface fif(a<; Ci, is the inter- 

section of the hyperplanes, the equations of which are obtained by equating ( 100 ) 
to arbitrary constants, thus equivalent to the following equations: 

S CjiXi ^ Cj ( J « r -4- 1, r -4- 2, . . . r -I - 1 — - 1 ). , ,(103). 

These equations being independent of » ar+i» so is the family F{a^, Now 
put r+1. The hypersurface ^(a,-r+i; is now the inter- 

section of hyperplanes (103) for « r + 1 , r 4- 2 , ... s, which are independent of 
«= O’, This means that F(a,-.r+i) is independent of « a. 

Thus all conditions which are sufficient for the applicability of the Neyman- 
Pearson method are satisfied and we may proceed to the construction of the best 
critical region Wq. 

Consider then hypersurfiewses formed by the intersection of 

h 3 rp 6 rplanes (103) for 7 « 1 , 2 , ... s — r and of the hypersphere 

<f>tr ** (104). 

According to the general theory the best critical region with regard to the 
alternative hypothesis hx consists of parts «;(Ci, ... CV^+i) of the hypersurfaces 
W (Cx , ... C^#-r+i) determined by the inequality 

p{(Xx, x$, ... <»n)|M >p{(^> *•' Oen)\H] fc(Ci, C7i, ... (7,-r+l) ...(105), 

where A?(Ci, Ci, ... O^^^x) must be chosen so as to satisfy the equation 

^ p{(a*i,...«w)|ir}<iTr(<7,....C,_^i) (106) 

iiJ w (C/j, ... 
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on eveiy hypersurface TV’(Ci, corresponding to every possible system of 
values of the constants Ci, ... 

The solution is somewhat simpler if we introduce new variables instead of (88). 


Denote by 

, ••• » ®r+li ••• ^ (^^7) 

the values of parameters corresponding to the hypothesis iST, and by 

al^ 02', ... ttr+i, ••• O' ( 108 ) 


those corresponding to the alternative The best critical region and the integral 
over this region from p{(a?i, ... «?n)l H\ being independent of the values of the last 
« — r -h 1 parameters (107), we may assume that their values are the same as 
in (108). 

Now instead of the x*s we introduce new variates 

Exy E 2 , ••• Eg, ••• Vn (109), 

connected with the x'b by means of the formulae (54). Instead of considering the 
probability function p{{xx, ..,x^\H] we shall have to consider the function (60), 
namely, 


p{(Exy E ^, ... Eg, ... yn)\H] =const. e 2cr* ...(60 his), 

n 

where nSa^ = 2 y** (58 bis), 

and nSo^ = 2 (cxi E\ + ck E% + . . . Cgi Eg)^ .(59 bis), 

»-i 


Having regard to (55) and (56) we conclude that the transformation used will 
lead to the following formula: 

pliEx, E^,,., Eg, yi,+*, ... yn) 1 M "* x « " ...(110), 

where 8(^ is the same as in (60 his) but where, instead of given by (59 bis), we 
shall have 8^ determined by the formula 

n^Sii* « 2 {cxiEx 4- E% + ... 4- Ch E^ 4- Cr+i ^r+i 4- ... 4- ^«)* ...(111), 

if ***' E^^Eu-su (fc«l, 2, ...f) (112) 

and c* *= a*' — a*® (^ « 1, 2, . . . r) (113). 

We shall make another linear transformation, introducing instead of ^r+it 
,,, Eg new variables, say ^r+i, ... which will transform n8f^ into 

n8f?^ 2 Ri^EiE^^ 2 (114). 

i-r+l 

The same transformation will give us 

n^i** 2 RuEi^E/ ^ 2 


(115). 
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It will be seen that the quadratic form 

£ti JSij ^ nSf^ .......( 116 ) 

is positive and definite and represents the minimum value of nSo^ with regard to 
all possible i^stems of values of ar44i, a«. 

The coefficients may be obtained from the coefficients as follows*. 
Denote by 

n 

— 2 Oitcjt^ga ( 117 ) 

and consider the determinant, say, 

\ gn 9it ••• 9u 

gti gn • • • gu 

g»i g^ ' • • g— 

Denote further by the minor of 6?, corresponding to and let 

( 119 )- 


* This is an application of the following general theorem concerning the linear transformations, which 


we shall have to use again below. 

H J W 

is a positive definite form and the functions 

+ ( 2 i = l, 2, ... res) (2) 

are independent, then it is always possible to find another « - r linear function, say, 

= + - (^ = r + l, r + 2, ... s) (8), 


such that together with (8) they form an independent system. Moreover, if the system of equations (2) 
and (8) is solved with regard to the j?’s and the solutions are substituted into (1) then we get 

’Jt" 

The coefficients in (4) may be calculated from the ooofficients pyy in (1) and in (2). Consider 
the determinant 

a= I 0„ ... Ou I <*)• 

ifn On On 


* Omi On ••• On I 

and denote by Gyy the elements of its reciprocal, say, G“*. That is to say, Gyy will bo equal to the minor 
of O corresponding to pyy, divided into G. Consider further the determinant, say. 



where **»=„*, <’*■ 

Kow will be the element of the determinant T-i, reciprocal to T. In other words is equal 
to the minor of T, corresponding to divided into T. 

Biometrika zxvn 


12 
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Consider finally the determinant, say. 


R- 


^11 ^la 
^^ai ^aa 




.( 120 ), 


Grl Ort ... Gr 
and let its minor corresponding to Then we shall have 


R 


.( 121 ). 


The probability functions of the new variables 


V^r+lf ''^r+a» * * * y»4-l» y«-f-a> . • • Ifn * • • (122), 

determined by the two hypotheses H and hi will have the previous forms (60 his) 
and (110) respectively, with the exception that the expressions of wSo* and nS^ 
will now be (114) and (115). 

We shall now transform to new variables the equations (103) and (104). It is 
easily seen that the equations (103) are equivalent to the following ones: 

= const. ( j = r + 1 , r + 2. . . . s) . . .(123) ; 

where should have the form (59 6i«), and where 


(j==l,2,...6) (124). 

But we may also use the form (114). The relationships between the -^*8 and 
the E's may be written 

yfrj — (y * r + 1, r + 2, ... «) ...(125). 

If we substitute these formulae into (114), we shall get identically (59 his^y 

7iSo^ - S Rij i f i hjt eX (126). 

The equations (103) transformed to the new variables may now be obtained by 
differentiating (126) with regard to ctj (/ — r H- 1, r •+• 2, ... 5 ), and keeping in view 
(124) and (125). We have 

^ i = »' + ^.»' + 2, ...» (127). 

Thus the new form of the equations (103) will be 

0 --- + 1.- + 2.....) (128). 

Cj being an arbitrary constant. The equations (128) depend only upon i^’s. They 
may be solved with regard to the ^*9 and thus replaced by an equivalent system 

•^r+l V^r+* = ^r+2, • • • (129), 

where the D*s are again arbitrary constants. 
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Turn now to equation (104). This is equivalent to 


n (8a^ + 480 *) const (130). 

Owing to equations (114) it may be replaced by 

n(^«»+5o*)-^«-Do (131X 

Do being an arbitrary constant. 


The problem of finding tbe best critical region is now reduced to that of 
determining the hypersurface, say w (Do, Dr+i, Dr+t» ••• Dg), which is a part of the 


intersection of (129) and (131) on which 

P {(Ex, ... Er, ... yn)|M 

^ P {(^ 1 > ya-¥X» ••• yn)!*^} ^ (Don Dr^\, ... Z)*) (132), 

k being subject to the condition of satisfying (106). Taking the logarithms of both 
sides of (132) we shall get 

- 9 ^. [«'»»*+ 2 RiiE^Ei'-\- 2 ^,*1 

L V «1 #«r+l J 

> o ^ EijEiEj-\- S ^ (133), 

^ -tfCT- L v=»i t^r+l J 

and after some reduction 

i (PJiEj E/E/) > k<T^ » kx (say) (134). 

*i-=i 

Owing to the relation Ei == Ei — €i (136), 

we have EiE^’- Ei Ef — €iEj Ei -- ete^ (136). 

Thus (134) may be written in an equivalent form : 

2 S Bij€iEj— S Bij€i€j'^kx (137), 

H—X 

or ^ RiAeiEi'^ko (say) (138). 


The region w (Do, •“ is the part of the intersection of (129) and (131), 
say W (Do, Dr+x> ••• Dg), determined by the inequality (138), where ko should be 
so chosen that the integral of the probability function corresponding to H over 
w (Do, Dr^xi Dg) should be equal to e multiplied into the integral of the same func- 
tion over W (Do, Dr+Xi ••• Dg), It will be seen that the problem of determining 
w(Do,Dr^Xf ... Dg) is that of finding such a value k^, that the relative probability 
(determined by H) of (138), given (129) and (131), may be equal to e: 

P{(h>^2)\(H), (yjro^Do), (^^+i-Z>^+x),...(^.-i>,)}«€ (139). 

This last problem would be easily solved if we could determine the simultaneous 
frequency function, say, p (fx, yfro, ^|rr+x , ... ^•) of variates fx. ^r+i, ... 


As all these variates depend only upon 

Sat Exf Eo, Eg (140), 

we may start with the respective frequency function given by the formula (62), 

piSa.Ex.E,,... - const. X (141). 

12—2 
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where nS^ has the form (69 W«). Introducing ••• variables, 

by means of the formulae (125) we get p (iSo, IPi, ... i?,. '^r+i. ••• V^*). having the 
same form as (141) except that nfSo* has now the expression (114), 

_„s«*+s»*+s.* 

p(8a, Hi, ... Er,‘>lfr+ 1 ,--- -ir,) = const. X Sa’'~’~'^e ^ ...(142), 

where niSr.*= 2 (143). 

Our next step will consist in changing the system of variables. 

Say El, ... Er, V^r+l » • • • » 

into the new one, (144), 

by means of the formulae 

= X RijEj for t == 1, 2, . . . r 

for t or = r + 1, r + 2, . . . s .(145). 

yjto ~ nSa^ + nSf,^ > 


Tt is easily seen that 


X Gj^ylrpylr^ (146). 


The Jacobian of the transformation being 

^ • • « Ery^ llr^i , yj r^) 

0 (^ 1 , ... yjr^^yf'o) 


(Sa, Ely ••• Er, '^r+l$ ••• ( 147 ) 


ti ~ 3 - 2 _ 

we find p {yfroy'^iy = const. X (yfro — nSb^) 2 g 


.(148). 


The last step in this series of transformations will consist in introducing fi, a 
new variable, and also in representing nSb^ in (146) as a sum of squares. This will 
bo done by applying again the theorem of algebra I have given in the footnote on 
p. 177. 

It will be seen that substituting (145) in (188) we get 


Si^XeiylTi (149). 

i=i 

Applying the theorem just mentioned we may add to this equation another 


7 — 1 of them, 

Si^X (t = 2, 3, ... r) (150), 

such that they will form a system of independent equations, and if used to transform 
(146) to the new variables fi, ... fr this will give, say, 

niSV - ^11* ft" + S (161), 

where Zir^ ==X (152). 
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The Jacobian of the tranaformation of ••• V't into the new syatem 

fi . f*. ••• fr. V'r+i. . • ■ ■^0 

being (^vioualy constant and different from zero, we get 

n-8 -2 ^ -f my 

(153). 


The required probability function of ••• is now easily found by 

integrating for (i ■* 2, 3, ... r) within the extreme limits of these variates, compatible 
with the fixed values of the others. The extreme limits for fr will be, say, 

— * 2/r < fr < + Xr,. (154), 


if (155). 

Putting fr == Xr w we shall get 

/ +Xr 

p(fl» • • fr» * const. 

n ~ — 1 ^0 n -» * — 2 

X - ^11*, fi* - ^ e~ p\l - w*) ^ da (166), 

where the integral with regard to u may be included in the constant factor. 
Obviously the process of integration may be repeated till we are left with 

n -8 -t- r ~ 8 _ ^0 f- ny 

p(fi» ••• '«/^«) = const. X (*^0 — ^ e ...(157). 

Denoting the left-hand side of (139) for short by P(ArB), we shall now get 


.(155). 


n - 4» - 2 

V 2 


du (166), 




rZ 

I i> (fi . “^o. • • • V^.) 



/*+Z ‘ 


.(158), 


where — X and + X are the extreme limits of variation of fi compatible with the 
fixed value of namely, _ 

(159)- 

Equating (158) to e and performing the easy algebra, we get 


u-«+r-8 


n - 8 + r - 3 


f (1^0 - Zii* Si*) ^ « r (^0 - Zn* fi») ^ rffx ...(160). 

J J -Z 


Putting fi=Xf;, k%^LvQ 

V being a new variable, we reduce equation (160) to the following 

n-ff+r-S -8 

f (l-t»*) ® d»-«f (l-s») ds-eSfL- 

J Dto 


.(161), 


n-ff + r-S 

_ 


r+1 


1 »— s+r— 1 
2 ’ 2 


( 162 ). 
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From this equation v© may be found, and it appears to be an absolute constant, 
depending only upon e. Taking into account (161), (169), (152) and the last of 
formulae (145) we get 

= Vo \/ S Rij€i€j V nSa^ + nSi? (1 63). 

The inequality (138) determining the region w{Do, Dr^i , ... D,) may be written 
in the form 


2 Hij€iEj 




+ 7 £ 


^ Vo 


.(164). 


As this inequality is independent of the values Z>o, ••• Ds, we see that it 

determines the best critical region Wq, provided v© is found to satisfy (162). 


7. Is THERE A Single Best Critical Region wrTH Regard to the Whole 
Glass of Alternative Hypotheses ? 

Having thus found the general solution of the problem of the best critical 
region of the hypothesis H with regard to a chosen alternative hi, we have now 
to consider whether this region is changed if instead of the hypothesis hx we take 
into consideration some other alternative simple hypothesis, say ^ 2 - This question 
is very important for the following reasons. It has been shown by Neyman and 
Pearson that the tests based upon the best critical regions could not be bettered 
even if the a jiriori probability law of all unknown parameters were known. There- 
fore, in cases where for every admissible alternative hypothesis h the best critical 
region is the same, the problem of testing the hypothesis H may be considered as 
entirely solved : no better test can be found. If, however, the best critical regions 
with regard to different alternative hypotheses are different, then there is no 
“best” test of the hypothesis H, Whatever be the test, it could be bettered if the 
a ^priori probability law were known. There are intuitive reasons for believing 
that in these cases a “good” test is provided by the principle of likelihood. 

The best critical region with regard to the simple alternative hypothesis hi 
being defined by the inequality (164), we have to consider if and under what 
conditions this inequality is independent of hi. 

It will be perhaps useful to recall the significance of all the symbols in (164) : 


(165) 

denotes the minimum of the sum of squares 

2 { fCi-mif = i (a?i~ i (166), 

J=rl 

calculated under the assumptions (1) that the a;’s are constant, (2) that the para- 
meters 

a, = a,» 0 -1, 2 ,... r) (167) 
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have the values specified by the hypothesis tested, and (3) that the remaining 
parameters Oj for r + 2, ...s are arbitrary. It is seen that (165) does not 

depend upon the alternative hypothesis. Symbols for 1, 2, ... r are also 
independent of the hypothesis h^. According to formula (116), 

^ S RijBliEj (168), 

and the R'b are the coefficients in the expression of nS^ in terms of the E*b and 
may be calculated in terms of the given coefficients c^. The meaning of the E*b is 
given by the formula (66), i.e. 

(^ = 1, 2, ... r) (169), 

where a* is a linear function of the a?’s giving the minimum value of the sum (166) 
if substituted for a|., for 1, 2, ... s. Obviously Ek for A=l, 2, ... s does not 
depend upon the alternative hypothesis Aj. The only symbols depending upon Ai 
in (164) are the e's. In fact accoitiing to (113), 

(^-l, 2,...r) ..(170), 

where is the value of the parameter specified by H, and a<' that specified 
by the alternative hypothesis Ai. 

We see therefore that the question whether the best critical region may be 
independent of Ai reduces to the question whether by some transformation the ratio 


S Rij €i Ej 



may be made independent of the e’s. This appears to be possible only if r * 1. In 
fact the numerator of (171) being a linear function of the e's, we may hope to be 
able to cancel them only if the denominator were a rational polynomial in €*s. 
Now comparing the expression under the square root in (171) with (168), we see 
that it is a special value of the positive definite quadratic form corresponding to 

Ei^€i 1 - 1 , 2 , ...r (172), 

and thus may be represented by a sum of at least r squares. Therefore the de- 
nominator in (171) is rational only if r — 1. We have thus the following interesting 
general result : 

If the hypothesis H considered in this paper is either, 

(1) specifying the values of more than one of the parameters ai (i » 1, 2, . . . s), or 

(2) specifying the values of more than one linear function of the same parameters, 
tlven there is no common best critical region with regard to all alternatives Ai. 

It will be seen that there will be no common best critical region for instance 
when we test the following hypotheses : 

(1) that the means of A > 2 sampled populations are equal ; 

(2) that the regression coefficients in A > 2 sampled populations are equal ; 
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(3) that the reprression lines in two (or more) sampled populations are identical. 
In fact, even if we compare only two regression lines, the hypothesis that they are 
identical can only be expressed by r * 2 equations. 

In all these cases the best test available seems to be that provided by the 
principle of likelihood, discussed in § 4. 


Let us now turn to the case r = 1 and consider whether there is a common 
best critical region with regard to all alternative hypotheses. In this case the 
ratio (171) reduces to 




(173), 


and the inequality defining the best critical region (164), to 


v^^ii ej ^ ^ Hii E\ €\ Vq 

^/nSa^ +RnEi^ ^ 's/nSa " Vl— Wo* 


,(174). 


It will be seen that in the most general case there will be no common best 
critical region even in this case. In fact, if 

ei = ai'--.aiO (175) 

may be for some admissible hypotheses positive, and for some others negative, 
then in the first case the best critical region will be defined by the inequality 




(176), 


and in the second case by the inequality 


(177)l 


In many cases, however, we may assume that the class of alternative hypotheses 
consists only of such hypotheses which ascribe a definite sign to the expression 

(175) . In such cases there is a common best critical region, determined either by 

(176) or by (177). 


As the left-hand side of the inequalities (173) and (174) may be written in the 
form 


V nSt 

s/ vSa* + n/Sfc* 


(178), 


we see that the test may be based on the value of the ratio 

.Q. 


Sa 


.(179), 


which, as was mentioned in § 4, follows “Student's'* probability law. In this case 
the best critical region is determined by the inequality 


Z 


l«il 




(180), 
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vrhere Mo should be found from the table of “Student’s” integral to satisfy the 
condition 


r<‘ 


n- • + 1 


a dz 




n- g-f 1 

) ~ a dz — i 


.( 181 ). 


We may combine this with the results of Markoff, who proved that the expression, 
say, 


a 


n — s jRi 


.(182), 


is the estimate of the standard error squared of the variate ai. Taking into account 
(169), we may now define the best critical region by means of the inequality 

(«! — ai®)|^>i;o v'n — (183). 

The above result may be used in two ways. Sometimes we may know the 
expression for Then the best critical region may be found directly from 

(183). In other cases we do not know the value of and it may be desirable to 
calculate it for some purpose. Then we may do so by calculating nSa^ or the 
minimum of the sum of squares 

(184), 

considering all s parameters a as variable. Next we minimise the same sum with 
regard only to s — 1 parameters, whilst one of them, ax ui®, is kept constant. The 
result is n + ^e,*). Substituting, we obtain 

w/Sffr* « Bxi (ax - ai®)* (186). 

As ax must have been previously found in the process of getting nSa^, we may 
now calculate and substitute it into (182) which will give us <ra^* This process 
will be illustrated below. 


8. Further Examples. 

In this section I will give two further examples of linear hypotheses. In the 
first of them there exists a common best critical region with regard to all alternative 
hypotheses and in the other there does not. 

Example 1. Denote by x and y the possible yields per acre given by some two 
varieties of cereals A and B. These yields depend largely upon the conditions of 
growth. If the two varieties are sown under similar conditions on adjacent plots, 
then both of them are similarly influenced and the yields thus obtained must be 
correlated. I shall suppose that experiments carried out in different experimental 


stations and possibly in different years gave the results 

Mean **= x, s.D. «* (186), 

yi , yt, . . . Mean = y, s.D. « cr^ (187). 


I shall further assume as known that the regression of y on a? is linear*. The 

* This has been found in many oases studied by M. Gorsfci and K. Iwasakiewioc. See BoezrUki Nauk 
Bolniczyeh i LeSnyeh, t. xxvxn. pp. 211 et aeq, and t. xxxi. pp. 277 et seq. Bee also W. Staniszkis, 
**Eigelmis8e dreljftbriger (1927 — 29) vergleiohender Versuohe mit Leinsorten,’* Wyniki Doswiadezen 
Polowyeh, No. d, Warsaw, 1984. 
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hypothesis, H, to be tested consists in the assumption that in conditions where the 
variety A is able to give a specified yield of X tons per acre, the average yield of 
the variety B does not exceed the value F, which is also specified. Usually it will 
probably be assumed that Y = X. 

If after having applied the test we reject the hypothesis jET, this will be equi- 
valent to the assumption that the experimental data give us sufficient evidence 
that in conditions when the yield of A is X, the variety B is better. 

Let us now present the hypothesis H under the form of a linear hypothesis. 
We start by noticing that the assumption about the linearity of regression of y on 
X means that if we fix any value of x and consider the population, say ttoj, of all 
possible i/’s, then the mean of this population, say will be connected with the 
fixed value of x by means of the formula 

= ai d- (1 88), 

where ax and are independent of x. This equation, which must hold for any 
value of X, and therefore for any value of this variable given in the sample, is 
equivalent to my equation (4). 

The role of the variates (1) is now played by the yields of the variety By denoted 
by y. The hypothesis to test is expressed by the equation 


'aix ~ d" Oj^X «= Y ( 189 ), 

while the alternative hypotheses are 

ss: cfj -4- 02 X > (190). 


The e(j nation (189) corresponds to the equation (6) of the theory. If there is 
evidence that the standard deviation in each array of y is independent of Xy then 
it is easily seen that the hypothesis considered here is a linear hypothesis. The 
number of unknown parameters s =» 2. These are the coefficients ax and 02 of the 
regression equation (188). The hypothesis itself is expressed by means of only one 
equation (189). Therefore ?• = 1. As the alternatives satisfy the inequality (190), 
we conclude that there is a common best critical region with regard to all alter- 
natives. It may bo defined by the inequality (183). In our case ai® means the 
hypothetical value Y of the ordinate mx of the regression line. ai is its value as 
obtained by method of least squares, 

d- r ^ (X — ^) (191), 

CTx 

where r means the correlation coefficient between (18t>) and (187). ei will be the 
difference between the value of the ordinate nix ^ specified by the alternative 
hypotheses and that specified by the hypothesis tested. Thus | | = ei > 0. The 

value of may be calculated by the formula of Miss K. Iwaszkiewicz (see 
Neyman(7)) or again it may be directly calculated following the method described 
in the last paragraphs of the preceding section. For the sake of illustration we 
shall follow this latter method. 
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We consider the sum of squares 

(192) 

and find its minimum with regard to ai and a%. This is known to be 

n8a^ ^ nay^ (1 - r*) (198). 


Next we assume that ai and Os fulfil the equality (189), substitute in (192) 


ai *= Y'-^ogX (194), 

and minimise, say, nx^ = 2 {y< — F— a# (ir< — X)j* (195) 

with regard to og. Easy algebra gives the result: 


» m.' + W - « { V ( t - H) + ;,rf (s' + 

(196). 


Comparing (196) with (193), (185), and (191), we find 


R 


n<rx 


Now (182) gives 




V(l-r»)/, . (X-sy 


• n — 2 \ (Tic 

and finally the best critical region is determined by the inequality 


.(197). 

(198) 


y + r^(X-®)- F>^oVn-2<r., (199), 

where Zo satisfies the condition (181). 


Example 2. The hypothesis about the Imparity of regression. Suppose that the 
observations gave us n pairs of values of two variables x and y. These pairs are 
grouped according to different values of a;, which are 

xiy x%y ... Wg (200). 

The values of y which correspond to the common value of x, say Xiy will be de- 
noted by 

ya, y< 2 , ... yint. Mean«y<, 2 (201). 

f ~i 

I shall suppose that it is known that the standard deviation of the population 
of all y*s corresponding to a fixed value of x is independent of the value of x and 
that the distribution of y’s in the arrays is normal. The hypothesis, H, to test is 
that the regression of y on a? is linear. 

We shall start by showing that H is what I call a linear hypothesis. Denote 
by the population mean of ys corresponding to a fixed value of x. That is to 
say, fn» will be the mean of an array of y's in the population sampled. The hypo- 
thesis H to be tested is expressed by the equation 

» .4 -f- Bx (202), 

where A and B are some constant numbers and x has any value. This equation 
may be considered as corresponding to the equations (6) in the theory. In order 
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to write the equations corresponding to (4s) we shall denote by the population 
mean of the variate y,v. As this corresponds to the value of a, we shall sum up 
our knowledge concerning niij by writing 

ni^ — mi (203) 

for any ^‘== 1, 2, ... n*- and i »» 1, 2, ... a Thus we see that the population means 
of all variables y are linearly expressed in terms of 8 unknown parameters ??i<. 
Here we see that a test of the hypothesis H is possible only if there is at least 
one array in which we have at least two observations. In fact, if for any i we had 
w, « 1, then we should have s — n, contrary to the assumption of the theory that 
the number of parameters is less than the number of observations. 

Now, as the role of the unknown parameters is played by the w^'s, we shall 
have to transform the equation (202) in order to bring it into correspondence 
with the equations (0), which are linear with regard to parameters and then contain 
only known numbers b and B, Substituting in (202) oo — iVx, x — ac^ and x^Xi 
for 1=3, 4, . . . s, we get 

vii — A — Bxi = 0 \ 

mx — A — Ba\ = ol ( 204 ), 

7/^2 ^ A — Bx^ = 0 j 

a system of linear equation which allows the elimination of A and B. The result 
is given by 

mi , 1 , Xi — 0, for 7 = 3, 4, ... ,8 ... (205), 

mi, 1, Xi 
7ff2f 1, X2 

or mi (x2 — xi) 4 - (xi — X2) -b m2 (xi — Xi) = 0, for 7 3, 4, . . . . . .(206). 

Now we sec another condition which the observations must satisfy to allow the 
test of the hypothesis //. The number of the arrays, or 8, must be at least equal 
to 3. Otherwise the equations (206), which express the hypothesis //, could not 
have been written. If ^ 3, then the number, r, of the equations expressing the 
hypothesis H is r = s — 2. 

It is easily seen that in the case considered there is no common best critical 
region with regard to all alternatives. In fact, even if « = 3 and thus r = 1, the 
class of alternative hypotheses would certainly include those which ascribe a value 


of 7^3 such that 

7773 — iTi) > — mi (av — fl?a) — Ws (a:i — (207), 

and also those for which 

7/18 (^8 — ^ 1 ) < ~ mi (xi- xt) — m2(xi^ Xi) (208). 


Therefore the test to be applied is that following from the principle of likelihood, 
defined by (72) or by 



( 209 ), 
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where «<• must be found from the Tables of the Incomplete Beta Fonotion so as to 
satisfy the condition 




i) 


,( 210 ). 


Now £ia* and Si,* are found as follows. We consider the sum of squares 


» S S (y<; - mt)* (211), 

Jml 

and minimise it with regard to all m,- for 1, 2, This gives us 

nS,*^ i f (y«-y,)* (212). 

< 0*1 j^l 


Next we assume that the hypothesis tested is true and minimise (211) with 
regard to what is not specified by the hypothesis. For this purpose we could use 
the equations (206) and consider mi and mt as being not specified by H. But 
obviously we may use instead the expression 

7ni = A Baci for t = 1, 2, . . . s (21 3), 

and consider A and B as numbers which are not specified by H, The minimum of 
(211) calculated under the conditions (213) is then, as it is well known, 


w (^V 4- ^6*) = n V ( I - r*) (214), 

where ay means the standard deviation of all y*8 in the sample, and r the corre- 
lation between x and y in the sample. Thus 

« m 

nSt* = ntr„* (1 - r*) - 2 {yij — Vi)* (216), 

and the critical region is determined by the inequality, say, 


u 


^ ^ {yij-Vif 



nay^ (1 — r®) 


< Mo 


(216). 


9. Summary of Results. 

1. The problem considered in the present paper consists in testing a composite 

hypothesis which may be described as a “ linear hypothesis concerning the 
means of n normal variates with common standard deviation.*’ It is supposed that 
the mean, m^, of each of the n normal variates, Xi^ having the same unknown 
standard deviation cr, are linear functions with known coefficients of 8 parameters 
ai (% 1, 2, . . . 8). The values of parameters are not known and the hypothesis H to 

be tested consists in the assumption, either (1) that some r < ^ of these parameters 
have some specified values, say ai^, ... a,.^, the remaining parameters being 
unspecified, or (2) that some independent linear functions, St, of these para- 
meters with known coefficients have some definite values, say 0%^, . . . 0r^. 

2. It was possible to give a general solution of the above problem, based upon 
the principle of likelihood, proposed by J. Neyman and E. S. Pearson. 
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3. A general expression has been given for the best critical region for the 
hypothesis H with regard to any simple alternative hypothesis hx, 

4. It has been shown that in cases where the- number r of the parameters a< 
(or of their linear functions) which are specified by the hypothesis H exceeds 
one, then there is no best critical region common to all possible alternative hypo- 
theses. In those cases therefore the method of testing the hypothesis H must be 
based upon the principle of likelihood — unless some other "‘good'’ test is invented. 

6. If r = 1 and if tlie class of admissible hypotheses is limited to those ascribing 
to the parameter, say ax , or to the linear function, say Ox (which are specified by H) 
values which are larger (or smaller) than the values specified by H, then there 
exists a best critical region, common to the whole class of the alternatives. In these 
cases the solution obtained of the problem of testing H could not be bettered even 
if the probability law d piriori of all unknown parameters were known. 
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EIN BEITRAG ZU UNTERSUCHUNGEN UEBER ZWEl- 
DIMENSIONALE VERTEILUNGEN VON MAS8ENPUNKTEN 
BEI ZUFALL8ARTIG BEDINGTEN BEWEGUNGEN*. 

Von henry MAHLMANN (Hamburg). 

Einlbitung. 

In dem Gnindproblem der vorliegenden Arbeit handelt cs sich um die 
Bestimmung einer Verteilungsdichte, wenn von einem Flachenelement mit dem 
inneren Punkt Q — dem Quellpunkt — ausgehend N Massenpunkte sich Uber eine 
Ebene in folgender Weise verteilen: Durch Q mogen m verschiedene Geracjen 
Gti O 2 . Gs, Gtn gehen, die rn Bewegungsrichtungen reprasentieren. Auf jeder 
dieser Geraden gelte ein Wahrscheinlichkeitsgesetz Vr(^r)» das ist die Wahrschein- 
lichkeitsdichte ftir die Lange einer Bewegung auf Gr oder parallel dazu. Vr (Sr) dsr 
ist also die Wahrscheinlichkeit, um auf einer Parallelen zu Gr von einem als 
Anfangspuiikt genommonen Punkt Pr-i bis zu einem Punkt Pr zu kommen, der 
(von ausgemessen) zwischen Sr und Sr-^dsr liegt, dabei ist Sr irgendein 

Parameter, der eineindeutig den Punkten der Geraden Gr zugeordnet ist. Ein 
Massenpunkt Mi bewege sich nun zunachst von Q ausgehend langs Gi gemfiss der 
dortigen Verteilungsfunktion ?;i (^ 1 ) bis zu einem Pi, darauf von Pi ausgehend 
parallel G 2 bis zu einem Punkt Pa geinass Va (sa) usw. Allgemein bewege sich Mi 
von Pr-i ausgehend parallel Gr bis zu einem Punkt Pr gemass der auf Gr geltenden 
Wahrscheinlichkeitsdichte Vr {fir)- Nach m solchen Schritten, deren Reihefolge 
beliebig ist, da es sich um einfache Vektoraddition handelt, befindet sich Mi an 
einem Punkt Pm = P- Her analoge Vorgang hnde ftir alle M Massenpunkte statt. 
Gefragt ist dann nach der Dichte <I>i (P), d.h. nach der Anzahl der Massenpunkte 
pro Flacheneinheit an einem Punkt P der Ebene und zwar bei grossem m und M, 

Es sind dabei alle m Richtungen als gleich wahrscheinlich angenommen; ist 
das jedoch nicht der Fall, so lasst sich dieser auf den obigen zurtickftihren, indem 
man die Vr (sr) entsprechend andert. Sei z.B. im extremen Fall die Bewegung 
parallel einer bestimmten Geraden Gr unmtiglich, so kann man das ausdrticken, 
indem man setzt: 

lim Vr (Sr) dsr “ 1 ftir den Punkt Q (oder den entsprechenden Punkt Pr~i), 

Vr (fir) “ 0 ftir alle anderen Punkte. 

Aus der Verteilung 4>i (P) ergibt sich nun eine weitere, indem von alien 
Flachenelementen df mit einem inneren Punkt Q {Q) df Massenpunkte in 

* Diasertation zor Erla.ngiing der Doktorwiirde der Mathematisoh-Naturwisaenachafiliohen Fakultat 
der Hamburgieohen Univeraltkt. 



192 


Ueber Verteilmigen von Ma^senpunJcten 

derselben Weise ausgeheti wie vorher die N Massenpunkte von Q aus. Die 
Verteilungsdichte der <I>i*(0) c^Massenpunkte um Q ist also 

dabei ist ^i(QP) gleich ^i(P) bezogen aiif $ statt Q als Anfangspunkt. Die 
Ueberlagerung aller dieser Verteilungen ergibt dann 

4>, (P) = ^ (QF) 4>i (Q) df. 

Allgemein erhalt man die n-te Verteilung durch die Rekursionsformel 

(P) = \ (QP) (Q) df, 

wobei die Integrale beidemal liber die ganze Ebene zu erstrecken sind. 

Niujh Ldsung dieses Qrundproblems soil dann die Verteilung berechnet werden, 
wenn sich von alien Punk ten einer bcgrenzten Fl^he mit der Dichte N die 
Massenpunkte in der obigen Weise liber die Ebene ausbreiten, wobei mit Rlick- 
sicht auf die Anwendungen einige Spezialfalle in bezug auf die Art der Begrenzung 
naher ausgefuhrt werden sollen. 

Ueber die biologischc Bedeutung der folgenden Ausflihrungen mochte ich an 
dieser Stelle noch einige Erlauterungen geben. Es handelt sich in biologischer 
Hinsicht bei der vorliegenden Arbeit im wesentlichen um folgende zwei hauptsach- 
lich interessierende Fragen: 

1. Wie viele Moskitos befinden sich durch schnittlich pro Fl&cheneinheit in 
verschiedenen Entfernungen von einem stark Moskito-verseuchten Gebiet, wenn 
nur in diesem eine Vermehrungsindglichkeit fUr jene vorhanden ist? Dabei mdge 
die Ausbreitung der Insekten nur durch zuialliges Hin- und Herfliegen zustande 
komincn. Von irgend welchen grosseren bewussten oder absichtlichen “Zielflligen ” 
und von ausseren Hemmnissen sei also zunachst abgesehen. Hicr angenommene 
Verbal tnisse liegen z.B. annahernd in dem Steppengelandc des Wolgagebietes 
vor. Dort grenzen an die Sumpfniederungen der Wolga, die stark mit Moskitos 
bevblkert sind, verbal tnismassig grosse Steppengebiete, die den Moskitos eigentlich 
nur in der Nahe menschlicher Siedlungen, durch die dort vorhandenen Tiertranken 
oder Wasserlachen, eine Brutmoglichkeit bieten. Werden diese nun desinfiziert 
(z.B. durch Bestreiien mit Arsenstaub), so kann eine Malaria-Uebertragung nur 
durch die aAUS den Wolganiederungen eingestreuten Moskitos zustande kommen. 
FUr die Bekampfung der Malaria ist es in einem solchen Fall daher ausserordent> 
lich rilitzlich zu wissen, wie weit eine betrachtliche Anzahl dieser Moskitos in das 
Stoppengelande eindringt. Ein beliebig weites Eindringen ist des allmahlichen 
Absterbens wegen nicht mbglich, da ausserhalb des FlussgUrtels wohl Gelegenheit 
zur Eiablage, nicht aber, ofler in verschwindend geringem Masse, Mdglichkeiten zur 
Vollendung der Entwicklung vorhanden sind. Auf die Wichtigkeit dieser Frage 
flir die Malariabekampfung wurde bereits von Sir Ronald Ross hingewiesen, der 
auch schon eine Verteilungsfunktion zu berechnen versuchte, worauf sp&ter noch 
zurUckzukommen sein wild. 
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diesem Zusaramenhang mdchte ich darauf hinweisen, daas die Ausfulurungen 
dieanr Arbeit sich nicht auf solche Tierarten beziehen, die bei ihren '^Flflgen'* 
infolge irgendeines Instinkts absichtlich — wie z,B. die Bietien bei ihren t&glichen 
Fltlgen zur Honigsuche — an den Ansgangspunkt zurUckkehren, sondern ein Zurtiok- 
konunen an den Ursprung eei reiner Zufall. In obigem Fall ist z.B. wegen der 
Mdglichkeit der Eiablage in der Steppe ein regelmassiges ZurUckdiegeu der 
Moskitoe an die Brutplatze des Flussgebietes selbst dann nicht wahrscheinlioh 
wenn ftlr jene ein sogenannter ** Heimatsinn anzunehmen ware, wobei ala Heimat 
im Fall der Moskitos Qegenden mit Bratmbglichkeiten anzuaehen w&ren. Die 
Mdglichkeit des VorhandenBeins eines ** Heimatsinns ’’ ist im Ubrigen bei den 
meisten Moskitoarten noch durchaus ungeklftrt. 

2. Wie wird sioh die Ausbreitung (Infiltration) einer Tierart in ein zun&chst 
unbesiedeltes Qebiet, das an ein besiedeltes grenzt, abspielen? Dabei ist wieder 
die wichtigsfce Voraussetzung, dass die Ausbreitung nur durch zufklliges Hin- und 
Herfliegen erfolgt. Weiter sei angenommen, dass alle Punkte der Ebene als 
“Pl&tze mit Vermehrungsmttglichkeiten” oder als “habitats** (in dem Sinne der 
noch zu besprechenden Arbeit von Herrn Prof. K. Pearson) mdglioh sind oder 
zumindest, dass die Entfernungen der einzelnen “ habitats ** sehr klein sind gegen- 
iiber der durchschnittlichen Lange eines Fluges. Dieses Problem ist natiirlich nur 
als ein dynamisches zu betrachten, denn der station&re Endzustand wird in dem hier 
angenommenen einfachsten Fall der ortsunabhilogigen Vermehrungsmdglichkeit, 
cSrtlich konstant sein. Es kann sich also nur darum handeln, den zeitlichen Verlauf 
einer oben beschriebenen Besiedlung eines zun&chst leeren Qebietes darzustellen, 
und zwar, wie gesagt, eines Gebietes, das an fast alien Punkten Siedlungsmdglich- 
keiten bietet. Auf den Fall, dass solche Siedlungsmdglichkeiten nur an verein- 
zelten Stellen, deren Entfernungen in die Grdssenordnung einer Fluglftnge oder 
mehr kommen, vorhanden sind, sind also die folgenden Ergebnisse nicht oder 
hdchstens als erste grobe Ann&herung anwendbar. 

1. V orausaetzungen und Ergehniaae der dlt^en Arbeiten. (^Ronald Rosa, Lord 
Rayleigh^ Karl Pearaon,) 

In diesen Arbeiten handelt es sich um ein ahnliches wie das in der Einleitung 
diskutierte Problem. Die wesentlichen biologischen Voraussetzungen, wie sie in 
der Arbeit von Herrn Professor Pearson angegeben wurden (s, Lit,-Verz. Nr. <S), S. 3 
und 4), seien im folgenden kurz dargestellt. 

Ueber eine geniigend grosse Fl&che seien gleichfbrmig sogen. “habitats” — 
d.h. ftlr Mlicken : Platze mit Brut- und Lebensmdglichkeiten — verteilt. Die 
durchschnittliche Flugl&nge I einer MUckenart von einem “habitat** zu einem 
neuen m5ge als “Bight” bezeichnet werden. Diese “Bights** sind zu unterscheiden 
von den “flitters,” die ein blosses Hin- und Herfliegen in der Nachbarschaft der 
“habitats** zwecks Nahrungssuche etc. bedeuten. 

Die Frage des Grundproblems lautet dann: Wie gross ist die Anzahl der 
MUcken pro Fl&cheneinheit — also die Diohte ^ii(P) — an einem Punkt P in der 
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Entfemung r von Q, wenn von Q ausgehend N Miicken n ‘‘flights” der L4nge I 
(=» durchschnittlicher Abstand benachbarter “habitats”) ausftihren? Dabei soil es 
sich um sogen. “Random Migration” handeln, d.h. die Verteilung soil durch 
zufalliges Hin- und Herfliegen entstehen. Die einzelnen Flugstrecken (von einem 
“habitat” zum anderen) seien jedoch gradlinig zuriickgelegt und wie beim ersten 
Flug seien von einem Ausgangspunkt aus alle Richtungen gleich wahrsoheinlich. 
Beim ersten Fluge verteilen sich also N Miicken in erster Naherung gleichmassig 
um Q auf einen schmalen Kreisring mit dem Radius L Beim zweiten Flug 
wiederholt sich das von jedem Flilchenelement des ersten Kreisringes, und die 
Verteilungen auf den nun entstehenden Kreisringen Uberlagern sich und ergeben 
die Verteilung nach dem zweiten Flug. In analoger Weise erh&lt man alle weiteren 
Verteilungen. 

Sir Ronald Ross fiihrt die Frage auf ein lineares Problem zurtick. Er macht 
nnscheinend die Annahme (das ist in seiner Arbeit jedoch nicht genau angegeben), 
dass nach einem Fluge von Q aus — bei Projektion der einzelnen Fliige auf die 
A-Achse — 4 JV der Miicken in Q geblieben sind, wahrend je JiV" sich im Abstand 
I links bzw. rechts von Q befinden. Von diesen drei Punkten wiederholt sich jetzt 
dieser Vorgang, nur dass statt jetzt bzw. Miicken sich verteilen, so dass 
nach dem zweiten Fluge in Q : -nr-^ rechts und links von Q im Abstand 
je und im Abstand jo Mucken sich befinden. Das gibt dann nach 
n Fliigen die bekannte Verteilung der Binomialkoeffizienten von 2*" multi pliziert 
mit Es befinden sich also nach n Fliigen im Abstand kl von Q insgosammt 




fUr A; > 0 
fur k = 0 


(I). 


Wie aus der Wahrscheinlichkeitsrechnung bokannt gilt nun fur grosse n 
und n ^ k 


/ 2n v 2.y^ 2N 
\w — k) 2®” V nTT ^ 


( 11 ). 


Das muss also, da in Wirklichkeit eine kontinuierliche Verteilung besteht, die 
Anzahl Mucken sein, die sich nach dem n-ten Fluge auf einem Kreisring der 
Breite /, der den Kreis mit dem Radius kl umgibt, befinden, Bedeutet also <I>u(r) 
die Dichte im Abstand r von Q, so muss sein: 

2N r2«- ,•(*+«)? 

“ 7 ===- c w = 1 <Pn{r)rdrdd> wobei 0 < 5 < 1 

VWTT Jo 

((k + 6)l 

as 27r I ^n(^)^dr 

rik+B)l 

= 27r4>n (Ik + €] 0 rdr wobei S — 1 « € ^ S 




(HI). 
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also muss die Diohte fUr ein r, dass zwischen (A; — 1) 2 und (A; + 1) A liegt, sein 

Ne n 


4>n([* + 6] 


(A? + S — J) TT '/jm 


(IV). 


Diese Beduktion des zweidimensionalen Problems auf ein eindimensionales ist 
jedoch wohl reichlich willkllrlich, und wieso dieses Resultat mit den spateren von 
Lord Rayleigh (V) und K, Pearson (VI) fUr grosse n iibereinstimmt, wie Ronald 
Ross behauptet, ist wohl kaum einzusehen, denn jene erhalten ftlr grosse n: 


(r) 


N 

nl^ir 


e nP, 


also ([*+€] A) nP . 

Da nun sowohl S wie e nur zwischen — 1 und + 1 variieren kbnnen, geht dieser 
letzte Ausdruck rnit n-^oo etwa wie 1 :n gegen 0, wahrend (IV) nur wie 1 \s/n 
gegen 0 geht. Der absolute Unterschied zwischen diesen beiden Resultaten ist 
allerdings fiir grosse n nur klein, da ja beide gegen 0 gehen ; relativ unterscheiden 

sie sich jedoch der Grdssenordnung nach etwa um den Faktor mit 

wachsendem n also beliebig gross werden kann, und nur bei k ^ \J\. 
ziemliche Uebereinstiminung zu erwarten. 

Etwas naher kommt der gestellten Aufgabe das von K. Pearson zitierte Resultat 
einiger Arbeiten von Lord Rayleigh liber akustische Probleme (s. Lit.- Verz. Nr. (2)). 
Hierbei sind zwei Richtungen {x- und y-Achse) fiir den von einem Punkt aus- 
gehenden gradlinigen Flug I als gleich wahrscheinlich angenommen. Ebenfalls 
sollen die Wahrscheinlichkeiten fiir einen Flug nach der positiven und negativen 
Seite der a?- bzw. y-Achse hin gleich sein. Fiir grosse N sind also folgende Verteil- 
ungen zu erwarten : 



18.-3 
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Nach dem ersteii Flug befinden sich in den 4 Punkten (x), siehe S. 195 : 

^== 0 , y^±l\ sc^±l, y = 0 , 
jc -AT: 4 und in alien andern Punkten der Ebene 0 x -AT Mtickeii, 

Von diesen vier Punkten verteilen sich nun die iV: 4 wie vorher die N Miicken 
von Q aus, so dass nach dem zweiten Fluge sich in den Punkten (x): 

a? = 0, 3 / = 0 iV' : 4, 

^ « 0 , \ a;=± 2 ^, 3 / = 0 ... je iV": 16, 

y=±l... ... ... ... jei\r :8 

Miicken befinden, wie man leicht aus obiger Skizze entnimmt. Analog erhalt man 
alle weiteren Verteilungen. Wenn n -^00 geht und gleiohzeitig I gegen 0, so dass 
endlich bleibt, ergibt sich eine Verteilung mit der Dichte 



die jedoch nur fur grosse 7i und r® -< nl^ angenahert gilt. 

Die exakte Lbsung dieses allgemeinen Problems — also die der Verteilung bei 
Annahme einer konstanten Fluglange mit Gleichwahrscheinlichkeit fur alle Richt- 
ungen — hat darm Professor Karl Pearson 1906 gegeben (s. Lit.-Verz. Nr. ( 8 )). 
Vorher hatte I. C. Kluyver einen allgemeinen Ansatz mit Hilfe Besselscher 
Funktionen aufgestellt, der nicht zura Ziele fiihrte, jedoch die Lbsung einiger 
spezieller Problenie und den Bcweis fur die Richtigkeit der Rayleigh ’schen Lbsung 
fiir grosse n erbrachte (s. Lit.-Verz. Nr. (4)). K. Pearson erhielt als Lbsung ebenfalls 
ein Integral fiber ein Produkt Besselscher Funktionen (naheres fiber Besselsche 
Funktionen s. Lit.-Verz. Nr. (5), ( 6 )), namlich : 

«*>» (r) = i»r) {Jo (ul)}-du (VI), 

was sich durch geeignetc Umformung in folgendc Reiho entwickeln lasst : 

(^) = A’(roG >0 + V20y2 “h + ... ) (VII), 

a$. (I <vin). 

«72 

“it >3 = -^ , (IX). 

( eo2s^n(^) 

und !/», = (- 1)* — (X). 

Jo 

Die bestimmton Integrale in diesem Ausdruck lassen sich auswerten, man erhalt 
z.B. fiir 

"“” 1 ' = 9^** 

6 n - 11 50« - 57 

"• 12 »* ’ 1800n* ■ ■ ■• 
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Die Konvergeius der Reihe der o»-Funktionen ist nun fdr n< 7 aehr schlecht. 
FUr dieeen Fall haben K. Pearson und J. Blakeman die Yerteilungafunktion auf 
graphischem Wege gefunden. Fiir grosse n geht 

^n(r) (Va), 

gegen Nwq, das ist die Rayleigh ’sche Losung. 

Den bisher besprochenen Methoden lag, wie gesagt, die Annahme einer kon- 
stanten Flugl&nge zugrunde. Vielfach wird in der Natur diese Annahme nicht 
berechtigt sein, sondem haufig werden sioh schon nach der ersten Verteilung von 
einem Zentrum aus in alien Entfemungen die entsprechenden Lebewesen vorfinden, 
vorauBgesetzt dass kein Punkt durch besooders gilnstige oder ungUnstige Lebens- 
bedingungen ausgezeichuet ist. In diesem Fall ist es nun durchaus nicht evident, 
dass nach n Fltlgen dieselbe Verteilung resultiert, wenn man nach einem Fluge 
von einem Zentrum aus annimmt: (1) Alle Mtlcken beiinden sich auf einem 
schmalen Kreisring mit dem Radius I, oder (2) wenn mann annimmt, irgendeine 
Verteilung mit derselben durchschnittlichen Fluglange I erstreckt sich Uber die 
ganze umliegende Fl&che. Wahrend beim ersten Fall die Verteilungen nur von 
Flacheneleraenten des Kreisringes ausgehen, die sich dann iiberlagem, findet 
dieser Vorgang in Wirklichkeit vielleicht von alien Flachenelementen einer 
ganzen Umgebung des Zentrums statt, und es ist nicht gesagt, dass die Verteil- 
ungen ausgehend von Fl&chenelementen ausserhalb bzw. innerhalb des Kreises 
mit dem Radius I sich derart superponieren, als ob sie alle von den Flachen- 
elementen des Kreisringes mit dem Radius der durchschnittlichen Flugl&nge I 
ausgingen. 

2. Die Grwndverteilung nach dem ersten Fluge um ein gegehenes Zentrum hei 
variabler Fluglange, 

In dieser Arbeit soli mit moglichst wenigen und plausiblen Voraussetzungen 
die Verteilungsfunktion bei variabler Fluglange gefunden werden. Ich gehe dabei 
gem&SB der in der Einleitung ausgefiihrten Problemstellung von folgenden Voraus- 
setzungeri aus, um die Verteilung nach dem ersten Fluge zu erhalten : 

1. Es seien m versehiedene Richtungen (wobei nachdem m-*^oo) moglich, in 
die eine Mtlcke fliegt. Die Richtungen seien dargestellt durch m Geraden durch 
den Ursprung. 

2, Die Flugl&ngen einer bestimmten Richtung Gr unterliegen einem Wahr- 
scheinlichkeitsgesetz Vr(«r)** 

3* W&hrend des Fluges wechselt die Mtlcke m-mal ihre Richtung und durch- 
fiiegt in der jeweiligen Richtung eine gem&9S Vr (sr) zu erwartende Strecke. 

A Die Wahrscheinlichkeitsdichten t^r(^r) auf den Geraden Or sind ausserhalb 
eines Kreises mit dem Radius K um den Ursprung Null. 


V («,.) braucht im einzelnen nioht bekannt za sein. 
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6. Die Vr («r) mogen fUr alle r einer Lipschitzbedingung genilgen, d.h. die zu 
den einzelnen Vr(fir) gehorigen Zahlen Mr besitzen eine obere Schranke M\ es gilt 
also flir beliebige Punkte der Qeraden Qr' 

Sr\. — 

Die letzten beiden Voraussetzungen erwahne ich nur, da sie beim Beweis 
benutzt werden, wahrend sie filr die eigentliche Problemstellung unwesentlich 
sind und nur die Art der sonst beliobigen Funkbionen VriSr) etwas einschranken. 

Unser Problem ist dann identisch mit der Frage nach der Wahrscheinlichkeits- 
dichte eines Punktes P (des Endpunktes des von Q aus abgetragenen 

Vektors QJP), den ich erhalte, wenn ich bilde : 

QP\ + QP 2 4 - QP 8 + . . . + QP m == f 

wobei die Punkte P auf den Geraden Or gcmass Vr(s.r) variieren. 

Fast dasselbe Problem ist nun fur den Ar-dimensionalen Raum von H. Pollaczek- 
Geiringer (s. Dit.-Verz, Nr. (7)) behandelt worden, nur variieren dort die Punkte Pr 
auf goschlossenen Kurven Or statt wie bei uns auf den Geraden Or. Fiir den 
Beweis ist davon aber nur von Bedeutung, dass man durch die Annahme geschlos- 
sener Kurven in einem endlichen Gebiet bleibt, was in unserm Fall durch die 
zusatzliche Festsetzung 4 (S. 197) t;y(sr) = 0 ausserhalb des Kreises (d.h. nichts 
anderes als dass es fur die Dangen bei gradlinigem Flug eine obere Schranke K 
gibt) gewahrleistet ist. Ausserdem kbnnte man sich die Geraden Or ausserhalb 
des Kreises IC in geschlossene Kurven tlbergehend denken, ohne an dem Problem 
etwas zu andern. 

Ich mochte mich deshalb hier auf den prinzipiellen Gang der Herleitung 
beschranken. Einzelheiten konnen aus der zitierten Arbeit entnommen werden. 
Gefragt ist zunachst nach der Wahrscheinlichkeit, dass die Vektorsumme 

2/1 (^i) 4“ 2/2 (^ 2 ) 4- ... 4“ 2/m («w») 

zwischen y und y 4- d'^ liegt, wenn der Endpunkt von yr (fir) auf Or gem&ss der 


Wahrscheinlichkeitsdichte Vr(fir) variiert. 

Wenn + (1), 

ist die Wahrscheinlichkeit dafiir, dass gilt 

m 

V m 4- < 2 yr (fir) < V W (w 4- du) .... (2) 

r=l 

gegeben durch 

^m(«)»=/ fvi(si)va(sa)... (®wi.) d8xdtS% ■ • • dSf^ .(3), 

J (f») J 


das m-fache Integral erstreckt tlber den Teil eines m-dimensionalen Wtlrfels in 
dem (2) gilt. 
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Mon setzt nun 

yx («l) + yi («•)+... + ym («m) -» 4* 6m, (4»), 

woduroh die neue Veranderliche u auf 6m (Mittelwert) als Koordinatenursprung 
bezogen ist und der Masestab wie 1 : tjm verkleinert ist. Das besagt dasselbe, wie 
wenn ich im alten Massatab annehme, dass statt eines Fluges bestehend aus m 
ElementarfitLgen einer bestimmten Eiohtung Or niit der resultierenden Verteilung 
Vr («r), in diese Richtung nur einer dieser ElementarflUge ausgefUhrt wird, fUr den 
die Streuungskomponenten sind : m und dessen Mittelwert ist : fjm mit den 
Komponenten art : nnd (s. (6) und (9)). Die durch (4) debnierten 

zwei Gleich ungen lassen sich nach und Sm, wenn die entsprechende Funk- 
tioualdeterminante D 0 ist, aufldsen. Die Werte fUr «m-i nnd «m kann ich in (8) 
einsetzen und erhalte : 


““ J" * . . j* (^l) . . . a) Vm—l (^m—l [^3) 

X g j dsx . . . d8fn-%dutdu%^ (3a) 

erstreckt Uber das m-dimensionale der Transformation entsprechende Integrations- 
gebiet. Dabei ist die Integration nach duj, und du% gerade iiber das Interval I 
duxdu 2 erstreckt. Wenn 

(u + du) s/m + 6„» in u s/m 4- 

Ubergeht, kommen wir von der Wahrscheinlichkeit J^n (w) zur Wahrscheinlichkeits- 
dichte Wfn (u), indem wir bilden : 

lim ^ - = w^n, (u) 
fu^^oduidu2 


dut 


= 1 • . • («l) • • • («».-*) »m-l («m-l [mJ) Vm (»m [“]) 

J (m - 2) J 

- 1 <i»i ... dsm-t (5). 


Dieses (m — 2)-fache Integral multipliziert mit iV" geht nun fiir grosse m in unser 
gesuchtes <bi liber. Es ist also lim Wm (u) zu untersuchen. 

m^ao 

Es sei nun noch zu den Vr (sr) ein Mittelwert dehniert als : 

= Jyr »,(«,) d«rt (6). 

m 

und es sei : 2 «* 6^ (7), 

r«l 

und als "Streuung” die zweireihige symmetrische Matrix 

SI) 


■2, 

(n~fl 

/-/n- /.-/r- /•-/: 


* und hangen natilrlioh auch nooh yon bis ab. 
t Es soil im folgenden heissen: 



200 


Ueher Verteilungen von Massenpunkten 


wobei « J Vr {yKr - «Ar) (y^r - a^r) dSr (9), 

und ea aei : 1 < a, < 8 (10), 

2 ( 11 ), 

also : ni und lim =* hj^ (12). 

m-*-oo 


Die seien endlich, sie biJden die positive definite Matrix der reduzierten 
Streuungen. Die Siimme der Streuungskomponenten der Vr(8r) wachse also wie 
m ins Unendliche, d.h. die Streuungskomponenten der verschiedenen Bichtungen 
seien fast alle von der gleichen Grossenordnung. 


Zu Vr sei eine komplexe Adjunkte 



/,(u)=j e' '“-V (Vr (O-Orl Vr{t)dt* 

(13) 

definiert, sodass 

/r(«r)=l 

(14). 


(ll-» 

(15). 


(a«AaO«r” 

(16). 

Analog sei die komplexe 

Adjunkte von 



Pm ('W') — i^Z) dZ\ dZ2 

(17). 

Fuhre ich nun statt S 2 die alten Variablen cin, namlich 



X.-1,2 

(18). 


so wird unter BerUcksichtigung von (6), 


/ \ f f rk i Jm ^ (Pir + 2 (2/2r“<^)f 

Pm(«) = J I Vx(Si)...Vm{Sm)d8i...ds„ 



i 




Vi (si) dsi 



^ Vm (Sm) dSn 





(19), 


d.h. die komplexe Adjunkte von Wfn{u) (ein Integralprodukt) ist auf ein gewohn- 


* ah bedeutet skalares Produkt der beiden Yektoren a und h. Also + wobei a,, 

6i, die Komponenten von a bzw. h aind. In obigem Produkt entsprioht: a dem Vektor (u-a,.) und 
h dem Vektor (t) - <v). 
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liches Frodakt der komplexen Adjunkten der einzelnen Veiteilungeo Vr 

zurUokgeiUhrt. Nach einem Satz Uber gewdhnliche Produkte von v. Mises (a. Lit.- 
Verz. Nr. (S)) gilt nun unter gewissen Voraussetzungen, die hier zutreffen, wenn 
ftlr die Funktionen fr (14) bis (16) gilt : 

= lira Pm (it) 

W ”*‘00 

g *” i (^11 “i* + ^**2*)^ (20), 

berechnet sich aus den sj^ nach (11), (12); ausserdem gilt: 

lim jjpm(u)y/r(u)duidua^jje '^(u)duidu 2 (21). 

Mit Hilfe des Fourier'schen Integraltheorems l&sst sich nun umgekehrt aus der 
Adjunkte einer Verteilung diese selbst wiedergewinnen und zwar ist: 

Wm (u) =* jje^^prn {z) dz^dz^ (22), 

wobei Pm die Adjunkte von Wm ist. 

Also bei Benutzung der vorigen Formel (21) mit — wird : 

jje-^pm(z)dzidzt 


(r)7J- 


e ^ dztdzt (23). 


Das Doppelintegral laast sich auswerten durch zweimalige Anwendung der Formel 


wobei ich erhalte : 


i 





lim «;„(«) = w(m) = 

m-^oo 


1 ^ 

2w y/ fill fha — 


h^Uj^ - 2^a Uj «g + V 

V) 


(24), 

(26). 


Das wgre also die Wahrscheinlichkeitsdichte fiir den Punkt P, den Elndpunkt 
des Vektors u y/m + b, den man durch die angegebene Vektoraddition erhalt. Oder 
bei der oben auf Seite 198 gemachten Annahme, die ja im wesentlichen nichts 
anderes besagt, als dass die L&nge einer gradlinigen Flugstrecke umgekehrt pro- 
portional ist der Anzahl der dberhaupt ausgefUhrten gradlinigen Flugstrecken, ist 

Nw (w) « 4>i (tt) (26). 

Deus ist die Dichte der Mtlcken — also Anzahl pro Flacheneinheit — nach dem ersten 
Flug an einem Punkt P dein Fndpunkt des von Q abgetragenen Vektors u, (Dabei 



202 


Usher Verteilungen ran Massenpmikten 


ist lim = 0 angenommen, andemfalls ist dio gesammte Verteilung um der 

Vektor b zu verschieben.) 

Als Ldsung de8 Grundproblems erhaltcn wir also eine Gauss’sche Verteilung 




N 


. 0 2 (hji hfiQ — ^la*) 


27r h>xxh 22 — 

Durch geeignete Wahl des Koordinatensystem, Drehung um den Winkel 

i 1 . 2Ai2 

<f>=inrctg. , , 

All — n22 

mittels orthogonaler Transformation der Koordinaten, indem man setzt : 

Ux = w cos — y sin - 1^2 = w sin 4* y cos <l>, 

Itisst sich (27) noch vereinfachcn, und man erhalt : 


.(27). 


dabei ist 
und 


sx 


2 (bxih22 — ^' 12 ^) 


.(28), 


^11 + ^#22 4" V4'/>i 2* 4- (/f'22 — ^11)^ 
2 _ 2 (hxxb22 ^12^) 


fi2" = 


bxx 4- ^22 — 4/ti2* 4- (/f 22 •“ ^11)* 

Sind nun keine Teilo der Ebene, liber die sich die Verteilung erstreckt, irgendwie 
ausgezeichnet, was ich hier der Einfachheit halber annehmen mochte, so muss aufi 
Symmetriegriinden * ^ 2 ^ sein, und es wird : 


N - — 


( 7-2 — ^2 ^ y2>^ 


N 


I e 


.(29). 


2s^ sn (T^ kann man als “Streuung” des Abstandes r vom Ursprung bezeichnen, 
wenn man als Definition der Streuung ansieht : 

Oder bei geometrischer Verteilung: 

U 2»r 1 _ Z! Iff - 

1 ff-,?'! 1 f ) 


= S» + s* = 2«® = 0 -* 


.(80), 
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da allgemein gilt (s. Formel (24)) : 


V2 


If-®* 1 r ^ 

==- («*« a»* d® = «*, -7===“ I « *•’ 
STT^J V27r«J 


Analog ist der mittlere oder durchschnittliche Abstand von Q : 

^ ” :^/o JT ^ 27r J^r*e " a?dr - | V«r .. .(31). 

Das ware wohl die Grdsse, die in den friiheren Arbeiten als durchschnittliche 
Flugltoge I bezeichnet wird, obgleich es genauer heissen miisste : durchschnitt- 
licher Abstand nach einom Fluge von einem Punkt aus, denn die im Durchschnitt 
tatsachlich durchdogene Strecke ist zweifellos grosser. 


3. Die von einem Punkt auagehende Verteilung nach dem n-ten Fluge, 

Nach dem ersten Flug befinden sich auf einem Flftchenelement dx^dtf mit 
— N 

dem inneren Punkt dafdy* Mlicken. Beim nachsten Flug verteilen 

sich nun diese von Q aus wie vorher die Qesamtzahl N von Q aus. Der entsprech- 
ende Vorgang findet von sEmtlichen Fl^henelementen der Fbene statt. Alle diese 
Verteilungen tiberlagern sich und ergeben die Gesamtverteilung nach dem zweiten 
Flug. Entsprechend erhalt man alle weiteren Verteilungen. Eis sei nun 

r2 


<I>n-i (r) = 




— 11 or* 


TT (n — 1) or’ 


.(82), 


dann ergibt sich nach der Methode der vollst&ndigen Induktion, da (32) fUr « 2 

richtig ist, namlich 


<I>i (r) . 


N ■ 


.(29), 


«>«(r)- Jj: 


N 

:rr — ^e {n - 1 ) dx^ dv* 

TT (n — 1) ^ tto-' 


1 - 


(r - r')® 


N 


7r*(w — 1) cr* 

N 


r« 


//' 

Sjj, 


nr^ ~2(n-l)yV 

(n - 1) IT* dx*d}/ 


3(n-l)ara:' + nar'a 
(n-l)«ra 


/ 


2 (n ~ 1) yy' + 
e (n - 1) o-a 


7r*(n — 1) 

nach zweimaliger Anwendung der Formel (24) erhalt man : 
N 


d,y'| dx\ 


ir*(n-l)<7* 


N -jm. N 

: ^ IKT* — e 4nZ“ 

riTra* 4n4* 


4>„(r) 


(38). 
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Die Rayleigh-Pearson'sche LOsung war fiir n ■ 

N 




nirl^ 




.(V). 


Sie weicht also fUr grosse n nur in geringeni Masse von der hier erhaltenen ab 
and wiirde rnit dieser zusammenfallen, wenn mit die “Streuung” nach dem 
ersten Flug gemeint ware. 

In einern Kreis iim Q mit dem Radius cr befinden sich nach n Fliigen : 

ctr f^TT jyr „ r ... ^ 

*'•<'>■1,1 w* [-« ’“-j. 

.n{i < 31 ). 

Das stiinint in erster Naherung mit dem Resultat von Kluyver (l.c. S. 344) bzw. 
Pearson liberein, wenn wie oben mit der iiiittleren Fluglange I das a gemeint ist. 
8ie erhielteii : 

(<r) = : (H + 1 ) = (^ - J, + + . . .) (34 a). 

Die Verteilung nach m Flugperioden, deren jede aus n Fltigen besteht, ergibt 
sich ohne weiteres aus (29), indem man setzt : 

rio** statt und m statt n, 

Es wird dann nach (33) 

(^) ” (^) (^^)* 

Also resultiert die gleiche Verteilung, ob m-mal Flugperioden aus je n Fliigen 
stattfinden oder ob eine Flugperiode aus mn Fliigen stattfindet. 

Ist nun von einer Miickenart Mx das bekannt, so lasst sich daraus fiir eine 
anderc berechnen, wenn man auch fiir diese nach dem sogenannten Prinzip 

von “Trial and Error*’ in der hier ausgefuhrten Weise die Verteilung, durch 
“alltagliche Fliige ” (s. Lit.-Verz. Nr. ( 17 ), S. 237) entstehend, annehraen kann, 
dass also nach der in der Biologic Ublichen Ausdrucksweise “ Random Migration ” 
vorliegt. Zunachst sei die Dauer eines Fluges von ^-mal so gross wie die von Mx, 
Dann kann man die Verteilung von nach einer Flugperiode aufTassen als die 
Verteilung von Mx nach t Flugperioden. Oder man nimmt an, dass bei ^-facher 
Flugdauer statt eines Eiomentarfluges einer bestimmten Richtung Or t Elementar- 
flUge dieser Art ausgefiihrt werden, wotiurch die Streuungskomponenten mit t 
zu multiplizieren sind uud daher auch, wie man aus Formel (8) bis (12), (28) und 
(30) sieht: 

<t2* = W (36). 

Also sowohl nach (29) in Verbindung mit (33) wie nach (35) erhalt man: 

‘*>4>n(r)-<^)^en(r) (37). 

Andererseits sei die lineare Fluggeschwindigkeit von A?*mal so gross wie die 
von Mx- Dann wird man annehmen kdnncn, dass die Wahrscheinlichkeit fUr 



HBNEY MlHIiMANN 


205 


die Lftnge einee Fluges l&ngs Gr bei Mx gleich ist der fUr die A;-&che L&nge bei 
M% Itogps derselben Qeraden Or, Man hat also in (6) bis (9) fUr yr kyr zu setzen. 
Daraus folgt, dass sjjj ttbergeht, und nach Beriicksichtigung der gleichen 

Formeln wie oben wird also 

(38). 

Unterscheidet sich also von Mx durch die ^>faohe Flugdauer und die A?-fache 
Fluggeschwindigkeit, so erhalt man die Verteilung fUr aus der ftir Mx als : 

(39). 

Umgekehrt lasst sich von den Verfceiliingen zweier Tierarfcen auf das Ver- 
bal tnis ihrer FJuggeschwindigkeiten schliessen, eine Grdsse, die bei Untersuchungen 
von A. J. Lotka liber das Fangen und Entkommen der Beute eines Raubtieres von 
grundlegender Bedeutung sein dttrfte (s. Lit.-Verz. Nr. (21)). 

4. Verteilung von ein&in mit der Dickie iV hesetzien Gehiet in die unheaetzte 
Umgehung^, 

Es gehe jetzt die Verteilung von einer irgendwie begrenzten und gleichmassig 
dicht besetzten Flache Q aus. Die Dichte sei N, Es befinden sich also in einem 
Flachenelement dx'dy* Ndx*dy^ MUcken. Die Verteilung von jedem Flachen- 
element aus sei gleichmassig nach alien Sei ten wie in den ^ 2 und 3. Dann wird 
nacb n Flligen die Dichte an einem Punkt P ~ (a?, y) sein : 

fn (a> y) — Ndm'dy' ^ <E>„ + (y ~y‘^) 

= f( ^df. (40). 

JJ IQ) 

Das Integral ist tlber die Flache Q zu erstrecken. 

Sei andererseits fiir das Aussere eines Gebietes Q die Dichte gleich iY, so ergibt 
sich flir die Verteilung in das ursprtlnglich freie Innere nach dem n-ten Flug : 

(«, y) = “ jj-d-df — df ~ N — y) (41 ), 

denn das Integral Uber das Aussere einer Flache Q ist gleich dem Integral tlber 
die ganze Ebene vermindert um das Integral liber das Innere von Q. 

Beiepiele f(lr versohiedene Gebiete, 

(a) Verteilung von einer durch eine gerade Linie L begrenzten Halbebene aus. 

Die Dichte rechts von L sei JV. Wie gross ist nach n Flugen die Dichte F im 
Punkt O (Koordinatenanfangspunkt) mit dem Abstand — ^ von Zl 

* Tdilweiae Bind die mathematisohen Formeln dieees Paragrapben im Prinzip in der PearBon’sohen 
Arbeit (a. Ijit.>Verz. Nr. (8)) Bohon enthalten, da eie aus den dortigen komplizierteren Satzen jedooh nioht 
BO ohne weiteres ersiobtlioh aind und iob haufig eine etwas andere Ableitung gewfihlt babe, babe ioh 
nkhere Hinweise unterlassen. 
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Nach (40), wenn die y-Aohse parallel X, ist 

{co) = F„ (0, 0) = I «!>„ dx'dy' 

= ^ffe 

7l7ra^J a; J 


\e ' wir* dfv'dy' 


j\r r f r ) 

= «w-sLr ”'“r ”-^dy'jd.' 


n'rraJ x 


JnTTCFJ X 

-E=r- 

Jirna 


_ 

e n<r^ dx' 


1 


iv|l - J_— re""ir«d4 (42), 


d.i. also die Dichte an einem Punkt mit dera Abstand — x von X. Die Werte des 
Integrals sind aus Tabellen, wie sie fast in jedem Werk iiber Wahrscheinlichkeits- 
rechnung oder Statistik (z.B. Lit.-Verz. Nr. («) bis(ii)) enthalten sind, zu entnehmen. 
Fiir £c==0, d.h. auf der Grenzlinie X ist die Dichte wie zu erwarten gleich ^JY'. 

(6) Verteilung von einem Rechteck mit den Seiten 2a und 26. 

Die Dichte sei wieder N, Die Koordinatenachsen sein parallel den Rechtcck- 
seiten und ihr Anfangspunkt sei der Schnittpunkt der Diagonalen. Dann ist die 
Dichte nach dem n-ten Fliig im Punkt (/*?, y ) : 

(®, y) “ f f ‘3><. - «)* + (y' - y)*) dx'dy’ 

J —a J — & 

fa f* ( a?^-a;)g + (j/^-?/)« 

; \ e dx'dy* 

J J - b 


JL 

7rn<r^ 

N 


f+a 

■I-/ 


•Jirnx 

-"I'-rv, 


(x' - a;)* 
n<r* dx' 


Jim or J -- 

'-r 


+6 (y' ~ y)* 

n&* dy* 

b 


— * 
e dx' 


e ^^dx* 




.(43). 


Jnira i Jn^a 

^ y' y' •••>} • 

Bei der V erteilung von einem schmalen Streifen, d.h. 6 *= oo , wird der zweite 
Faktor {!}. 

(c) Verteilung von einem Kreis mit dem Radius a um O als Mittelpunkt. 

Die Dichte sei wieder N, Dann ist nach n Fltlgen die Dichte an einem Punkt 
im Abstand c von O: 

f'n (c) = f f (s/c* 4- r* — 2rc cos rdrdd 
J 0 J 0 

— ^rr 

wntT^ Jo Jo 


4- r* - 2rc oob 0 
««r5 


rdrdO (44), 
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2re cos 0 


nach Entwicklung von e ^ in die bekannte e-Reihe : 


- . ic w* ^ or 


.(45). 


wird also JFV (c) = 


Da nun 
wird weiter 


N 

•jrna* 

2ir 


■ e no-' 


/Xf 


e ««r* r i 

27r f^k\ 


J*' C08“+I^d6> = 0, j*' coB"^d(9 = 


.(46). 


2iV^ 


e W<r2 


Die Integrale lassen sich sukzessiv durch Produktintegration auswerten, sodass also : 

{> (rX [■ i im 

(47). 

Die Konvergenz dieser Summe ist bei c® << na^ verbal tnisinassig gut. Man kann 
aber auch zur Auswertung bei Benutzung der vorhergehenden Form Tabellen 
liber die hoheren Momente der Gauss’schen Summenfunktion benutzen (s. Lit.- 
Verz. Nr. (9), S. 22, 23). 

Ist bei den behandelten Verteilungen das Innere zunachst frei von MUcken, 
wahrend ausserhalb der betrachteten Gebiete die Dichte N ist, so ergibt sich die 
Verteilung in das Gebiet hinein gemass (41). 

(d) Verteilung von einem Winkelraum aus. 



Innerhalb des von den Schenkeln des Winkels a abgegrenzten Gebietes herrsche 
wieder eine Dichte N. Dann wird nach n FlUgen an einem Punkt P mit den 
Polarkoordinaten p, die Dichte sein: 

0 aFn (p, ‘^) = ( Vr® + P* — 2rp cob (yjr r dr d4> (48) 

— unter Berilcksichtigung von (46) und d ^ 


•arw®** 

N 
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Wegen der unbedingten Konvergenz dieser Reihe kann man sie aufspalten und 
als Summe der Reihen mit geradem und ungeradem k schreiben. Berllcksichtigt 
man dann noch, dass: 

r _ u\ 

I e n<r2 

^ _ I (49), 

e - r® <*+ « dr = - (»cr*)*+» 

SO wird: 

.r. <P, « - if. - ji [,£j, « «»] 

Die noch verbleibenden Integrale lassen sich auch auswerten (s. Anmerkung*), 
und es wird nach Einsetzung und einfacher Umordnung: 

IT fr. AtI -ul V / P* V .AL_ 




/ \* ^ 

I UW (2A;)! 


cos ([A; — r] [ 2>fr — a]) sin [A? — r] a ^ p 

r-«\^/ ^ — r Vth 




* /2/fc + 1\ (2fc + 1 - 2r) (^y/r sin (2fc + 1 - 2r) | 


X 2:(2^+iV 

r-oV / 


2A? 4- 1 — 2r 


Die Giite der Konvergenz hangt wieder vor allem \on 

FUr a = TT, Jtt erhiilt man, wie man sich durch Einsetzen und Reihen ver- 
gleichung tiberzeugen kann, den nach (42) zu erwartenden Wert^: 

Fn(-p}=.F„(^p, 

* Es iBt : coB» J- ~ (c»» - = (^)** 5 T” ^ ^ 

r»xO / 

==(i)” S (” ) {oosa;(n~2r)-i8inaT(n- 2r)}. 

Da 008 ** fiir reelle x reell ist, miisaen die imaginaren Glieder der Summe 0 ergeben, also : 


cos’* .r = ( 2 { ^ j COB x{n- 2r) , 
r>»o \ ^ / 

(* co8»*rf*= (ir s -?r) I* 

J ^-a. r =«0 v^" / w - 2r 


2 cos (n - 2r)|( ^ ) sin (n ~ 2r) • 


= (i)* s 

r-»0 
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6. Awkreitwng in die dwrck Hindemieae {GMrge^ Waeaer od, dAnl,) 

hegrenzt sind, 

loh beschr&nke mich auf den Fall, dass die mdgliche Ausbreitungsflaofae durch 
eine g^radlinige Qrenze abg^eschlossen ist und mache die Hypothese, dass ein von 
einem Punkt ausgehender Flug an dieser Qeraden gespiegelt wird. Es waren zwar 
noch andere MdgUchkeiten fttr dae Verhalten der Miicken an dem Hindemis 
denkbar, mir erscheint jedoch bei der Annahme rein zufHlliger FlUge die oben 
gemachte Voraussetzung am glaubwUrdigsten. 

(a) Verteilung der N Mllcken von einem Zentrum aus. 

Die Greiizgerade H verlaufe parallel der j^-Achse im Abstand a von dieser. 
Von Qwm(0, 0) aus mdgen wieder N Mticken abfliegen. Dann ist nacb dem ersten 
Flug die Dichte an einem Punkt (a?, y) fUr «?<a: 

N 


dazu kommt die Dichte des Spiegelbildes von {x, y) bezllglich der Geraden H, 
namlich : 


N 


wa 


a 


e 


(a; + 2 [a + 


TTcr* 


(2a-a;)“4*y® 


Also wird die gesamte Dichte: 

(«. y) = 


ir<r* * 


. __ 


(2a-^)s + y\ 


+ e 


(51). 


Die Gesamtzahl der Mllcken ist natlirlich wieder: 


j ^a^ia!,y)dxdy = N (62), 

wie bei der Verteilung (83), was man leicht nachrechnen kann. 

Sei beim^n — l)-ten Flug jetzt die Verteilung gegeban durch a^n~i (jc, y), dann 
erhd,lt man (jc, y), indem von alien FliU^henelementen dx' dy* sich 

a^n-i W>g') da' dy' Mticken 

gemass ^ (^. y) verteilen, sich hberlagem und a^n («# y) ergeben. 


r ) 

Ftlr a=ir, erh&lt man die Formel 

2»(A:!)* 1 * /2Jfc + l\ (-!)»-»* 

(2fc + l)!~2» r<wQ \ + 1 ** ^ * 

da bekanntlioh duroh partielle Integration wird : 

xda:s:2 eoB**'^* xdxss2 * 

J ^wft Jo (24+1)1 

Die obige Formel i«t beim Yergleioh von (50) mit (42) anauwenden. 

Biometrika xxvn 


14 
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Bei der Annahme, daas gilt: 

fUr a: < a : («, y) = 4>„_i {x, y) + 4>„_i (2a - x, y), 

fUr«!>«; a^n-i («, y) = 0, 

folgt durch vollst&ndige Induktion, da nach (61) 

{x, y)’= 4ft {x,y) + ^i (2a- x,y) (61 o) 

ist, dass gilt: 

a^n (a?, y) =/"/ |^n-i(a:'.y') + 4>„_1 (2a - x'. y')j i 

X {4>i (x - x', y — y') + 4>i (2a — x' — x, y — y')} dx' dy’ (63). 

Dies Integral lasst sich in die Summe auB vier Integralen aufapalten und zwar: 

1 . I J<l>„_i(ai', 2/')4>i(a! — aj'.y — y')d®'<^y'- 

2. JY'^n-i (2a - x\ y') 4>i (2a — x'—x,y — y') dx' dy' 

= Jj ^n-i (S'. y') (S' — x,y-y')d^' dy', 

wobei 'la — of =^x*. 

Da nach (29) <I>i ( — a?, y ) =: <E>i (+ ic,y), ergibt 

1. + 2. = j (a;', y') (x — x',y~ y') dx' dy', 

das ist nach (33) == (^^f y)- 

3. J ^<Pn-i(‘i:',y')^x{^a—x'-x,y-y')dx'dy', 

4. j |4>„_i (2a — x', y') 4>i (x — af,y— y') dx' dy' 

= [ (S', y') (® - 2a + S', y - y') ds' dy', 

wobei 2a — a;' «= x' , 

also: 

3. + 4. - (x', y') 4>i (2a - ®' - a:, y - y') dx' dy' , 

das ist nach (33) = (2a ~ x, y). 

Also ist fur (T < a : 

1. + 2. -h a + 4. «= a^n (a?, y) = («> y) + (2tt - a?, y) (53a), 

w.z.b.w. 

(6) Vorteilung von einer mit der Dichte N besetzten Halbebene aus. 

Die Verteilung gehe also von einer durch eine Oerade G begrenzten Halbebene E 
mit der Dichte N aus in ein Gebiet, das durch eine Qerade H, die parallel G sei, 
abgeschlossen ist. 
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Dann ist nach n Flligon an einem Punkt (dem Nullpunkt) im Abstand- c von die 
Dichte : 

aFn (c) = («', /) Cto'd/ 

= j J>t>n («', y') da^ dy' + J j 4>„ (2a — 2c + y’)dx' dy’ (64), 

das ergibt nach (42) 

( V TTW a J ; 

4- iV ■{ 1 I e nar^ dx* >• 

( \wna-J ) 

= (c) + Fn (2a - c). 

Die Werte der Integrate lassen sich wieder aus Tabellen entnehtnen (s. Lit.-Verz. 
Nr. (9) bis <ll)). 

Man kann also (54) erhalten, indem man zu der Halbebene E eine bezUglich H 
spiegelbildliche fiktive Halbebene E' mit der gleichen Dichte N annimmt. 


Analog erhalt man die Verteilung der anderen in § 4 besprochenen Gebiete, 
indem man immer das zu der jeweiligen Grenze H spiegelbildlich gelegene Gebiet 
mit der gleichen Dichte N hinzunimmt und die aus dem wirklichen und dem fiktiven 
Gebiet nach § 4 zu erwartenden Verteilungen addiert. Als Illustration dazu mdchte 
ich noch die Verteilung in einen Halbkreis berechnen. 


(c) Eine mit der Dichte N besetzte Halbebene sei wieder durch ein Hindernis 
H abgeschlossen. Gesncht ist die Verteilung in einen zun&ohst freien Halbkreis 
mit dem Radius R Uber H. 


14—2 
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Nach dem n-ten Flug ist also an einem Punkt (a?, y) (bzw. in Polarkoordinaten : 
p, ‘^) die Dichte : 

J (Ji) J 

= [|4'n(a!' — «.y'— y) + ^n(®-®',y-y')|da!'dy' (56). 

Das Integral ist Uber das ausserhalb des Halbkreises und rechts von H liegende 
Gebiet zu erstrecken. Durch EinfUhriing von Polarkoordinaten : 

00 = p cos yjry y = p sin yfr; x' cos y' = r sin <f> ; 


wird also ; 


rF n ip» ~ I 


+nI2 I r=»- ^pg-2^pcoB(0 -^) 


+ p2 4 - 2rp cos (0 4- 0) 


nach (46) 


|rdrd</!>...( 66 ), 


N 

7rn<r^ 


N un ist aber : 


Jo^ 008‘(^ + 4 ^)d^} dr. 


f4-ir/2 f4“»r/8~0 f4-r/24-0 

I cos* — ylr) dd> *= I cos* <b d<f} « | cos* <b dd> 

J-ir/8 J —trn+^ 


> r'^coe*(0 + ^)d<^ (67), 

J -wt2 
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folglich heben sich die in der Klammer | | stehenden Glieder mit ungeradem 

k heraua, and es bleibt, da ausserdem : 


sFnip, ir) 


N 

0 ri<r’ 


-ifjl-— ,e“6 s ' 

( n<7* jfc-o(A?!)* \n<rV Jo J 




.( 68 ). 


Siehe (47 ), wobei das dortige c dem jetzigen p entapricht. j^F n (p, ist also nur von p 
abhangig und ergibt die gleiche Verteilung, wie wenn die Ausbreitung der MUcken 
von der vollen Umgebting eines Vollkreises vom Radius It in diesen hinein stattfindet. 

Bei den behandelten Verteilungen war bisher von einer Sterblichkeit bzw. 
Vermehrung der MUcken abgesehen. Isfc nun die dadurch bedingte Aenderung der 
Dichte vom Ort unabhangig und ist der Prozentsatz der nach einem Fluge abge- 
schiedenen bzw. hinzugekommenen Mtlcken gleich lOOA, so 6rhd.lt man die 
Verteilung nach dem n-len Flug durch einfache Multiplikation der Dichte mit 
(1 — A)^ Bedeutend komplizierter wird diese Frage, wenn A von Ort zu Ort 
variiert. In seiner ganzen Allgemeinheit l&sst sich dies Problem mathematisch 
wohl kaum Idsen. 


6. Die Verteilung, wenn in einer Malbehene nach jedem Flug ein bestimmter 
Prozentsatz OMSsoheidet, 

Zunachst mdchte ich eine spater zu gebrauchende allgemeine Formel ableiten. 
Bei .der Verteilung von einer Halbebene mit der Dichte N in die andere links davon 
gelegene war nach n — 1 Fliigen die Dichte an einem Punkt {x, y) gemass (42), 
wenn der Nullpunkt auf der Grenzgeraden liegt: 

r 1 ^ 


Nach dem nachsten Flug ist die Dichte: 

(®) - y j («') 4»i («-«', y - y’) dx' dy' 






V7r(» — 1) CT 

das aber ist andererseits nach (42) : 


r-*' - 5-"* - ) 

I e (n - 1) <r* da/* >• dx* 


.(69), 


1 
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also muss gel ten: 




I /•-»' 1 f-*» _ ^ * 

— I e {n-l)<r^dx'*\dx' - e n<fidx* 

J VTrno-J 

(60), 




Oder wenn ich seize 


Vtt (n — 1) 


^ und dann wieder a?" mit 


(n~ l)a* 
a?" und X* mit a*' bezeichne, wird: 


_L, ^ rvw " 1 n - 1 1 


V27 


V27 


e 2 cia?'' 


pa?' 


(61). 


Oder, wenn man aetzt — — — = a und 
Vw 4- 1 


-iJ 


Weiter folgt daraiis, dass gelten muss: 

f r 1 _ {ax'->rh 1 


e V dar' 


j.da:' = J' 


hj'J fi^ ■{■ &* X 

V27r 


e ~ 2 rfa;'(63). 


V27r/3 

(a) Es soli jetzt die Vertcilung von iV’Mticken ausgehend von einem Punkt (a, b) 
nach dem n-ten Flug bestimmt werden, wenn links der y-Achse der Vemichtungs- 
faktor A = 5 — konst, und rechts A = 0 ist, also fiir 

a;<0, A = 8; a?>0, A=0. 

A wird im allgemeinen natilrlich durch Vermehrung und Abstcrben bestimmt und 
ist kleiner beziehungsweise grosser Null, jo nachdem ob das erste oder zweite 
tlberwiegt. 

Nach dem ersten Flug erhalten wir dann folgende Verteilung (nach (33))*: 

<I>i (a; — a, y — 6) «* <I>i (a? — a, y — 6) fiir a; > 0) 

fUr a? < Oj 


.(64). 


: (1 — 3) <I>i (a? — tt, y - b) 

Diese unstetige Funktion <i>i (a: — a, y — b) kann man nun beliebig gut approximieren 
durch 

.(66), 


Oj (a: - a, y - 6) |l — S J e “2 daj'j- 


wenn nur h geuUgend gross gow&hlt wird. 

Ich seize also : 

$i(a! — a,y — 6) = (a!-a,y - 6) |l - s| 


-i.* 1 

- 7 =-i: e 2 dx^ 

V27r 


'} ■ 


(66 «■). 


* Dabei ist allerdings angenomxnen, Vermehrung und Absterben finden immer am Ende eines Fluges 
statt. 
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Die VerteiluDE naoh dem zweiten Plug wird dann bei Verwendung der noch 
abzuleitenden Formel (67): 

— o,y — 6) 

- W - a, y' - V) <I>i (* -x\y- y>) daf dy'jl - S j " “ T 


■ (je — a,y—b)^ ~ ^ J 


-(o+a)^*,^ +»,»«*) 1 


v'2t 


oder bei ki-*-ao wird: 

(a: — o, y — 6) = ^ (a: — o, y — fc) |l — S Q 

r'] + S*J 


{l-8j 


e 2 daf 
1 

V2ir 


} 


a;'* > 

Tdx'V (66), 


- (a + *)/«r 1 




r— 1 n J f 

• I -jrr^e 2 dx 

J V2» 


-(o + »)/<r 1 

-7= e 

V 27r 


dx^ 

\J2 it 


~ f 
2 dx*\ 


« 2 dx^ 


'} 


dabei ist wieder oo zu nehmen. 

Es gilt namlich allgemein: 

^ — a, y' - h) 4>i (i» - fl?', y - y') I ^ ~ ^ 


.(66 a). 




Jl{^ 


1 (a;^ - a)8 + (y ^ - b)^ (« - + {y- yT r - -i. ft) ^ 


ti<r“ 


= f 

TT^n^a^J 


(!/'-b)» + n(p-p')» .f (^'-«)® + n(»-a:0» W4.«) t 


«<r* 




V27r 


e 2 


c 2 


I rfoj'ciy' 

I da?' 


j\r 


/ im - .( y-ft)’'+(*-;;°)* r _» (•-(««'+/») 1 

r«v r J 71 


■7r*w*, 


, a -f na? ^ 
wenn x r-^i — f, 


e 2 cZa?" da?' 

(67). 


n ~h 1 


^»-n (g - a, y - fc) ^ TJ^t^Jl j^e ®2(n+l)' 


J ^2ir 

daraus folgt unter Anwendung von (63): 

= *«+i (® - «. y - ft) j 

K 


_ fT 

e 2 dx* 


jdr. 


-i: 1 


■e ^ dx , 


sl^Tc 

xna -f ott 4» (n + 1) 


Vi(n + 1) {2(n + i) 4-na*<r*} 


wobei 


,( 68 ). 
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Boi Benutsung dieser Formeln folgt aua (66), (66 a), indem man setzt n= 1, /3 ■■ 0, 

a ^ ki ^ QO . 


Da im allgemeinen S -< 1 sein wird, kann man, ohne einen groesen Fehler zu 
begehen, Glieder mit S* und hoherer Ordnung vernachlassigen. 

Mit der Methode der vollstandigen Induktion las«t sich dann zeigen, dass; 



4>n (x — a, 2 / — 6) = On (a; — a, y - 3)|l 

- S S 0(„7«,(a:))] 

Ar-1 ) 

'■ (69), 

%vobei 


a-2 

^ dx 

(70). 

und 

^ V kx -h (n — k) a 

n7*(«)“ - /, ,, 

^ V \nk (w — k) 

fllr k^n 

(71). 


n7n («?) — — Mx, mit Jif 4- 00 


Setzt man n = 2, so sieht man, dass (69) ilbergeht in (66 a), abgesehen von dem 
Glied mit 3* Waiter ist : 

y — ^) = ^ ~ y yO dx* dy* 

X {l-a@(n^.l7n+l(^))} 

~ y') 

(w7A: (^'))| dx* d/ {1 - 5 © (M+l7n+l (^))}, 

das ist unter Berilcksichtigung der Formel (67) bei Integration der einzelnen 
Summanden : 

= 4>„+i {a;-a,y-b)^- („+iyt («))| . 

wobei die Griisaen in der Klammer (70), (71) angeg^bene Bedeutung 

haben. Also gibt (69) die Verteilung nach dem n-ten Flug unter den oben er- 
wahnten Bedingungen an. Allerdings hat diese Formel nur Giiltigkeit wenn S << 1 
sodass die Glieder mit S^, 3*, etc. wegen immer vorhandner Ungenauigkeii 
des nicht ins Qewicht fallen. 

(6) Its sei jetzt die Halbebene rechts der y-Achse mit einer Dichte A" besetzt 
wkhrend links der y-Achse keine Miicken vorhanden seien. Dann wird nach r 
FlUgen bei der vorhergehenden Annahme tiber die Vemichtungsintensitat die 
Dichte an einem Punkt (x, y) sein : 

^ n (^) =» y ~ y') dx* dy* (72). 

Ich setze fUr so\y^ y') 

4>„ («-• 8’‘SS („y^ (w , «'))! <1 - «) ® J 

I * = 1 ) 1 „ x>\f 


( 73 ) 
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eio, das stimmt nach <69) bis auf das quadratdsche Glied, das vemacblftssigt 
werden kann. 

0 und Bind durch (70), (71) definiert mit statt dem dortigen a. 

Els wird also: 

K (^) - {/ y') [l - «*£'© («7»)] da:' dy'| ^ ® J q! 

Dacb (42) 

r:il 

^ ^ . (74). 

dabei ist : 

Pk (a;) = f I JL - « “ ' ' ^ ® ^ e " T da/ (76), 

Jii Wirncr J sl^ir ) 

Pk' (®) - -7=i=- e " ^ c ■ V dr" : ^{-\/ • • -(76), 

y/irntr J n-^k) 

w.b.i — (77). 

Ich seize dann : P' (x) = A (x) — P {x), 

also Pw (^) = A (^) - (x) (78). 

Die .A<*"“^>(fl?) bzw. (/») entstehen durch Dififerentiation der einzelnen Suin- 

nianden, aus denen P' {x) besteht. Es wird also : 

A'/ \ 20! fjft*) 1 -’C J . 1 / 2* ■ , /O! / 2 \ 

(79), 

und allgemein lasst sich dann durch vollstandige Induktion zeigen, dass fUr s > 2 : 
(ar) - - J-5 

WO** 

^ (80), 

und eben«> (o.) - ^ . 

was man durch einfaches Differentiieren verifizieren kann. 

Die Koeffizienten der Potenzreihe in (74) erhlilt man dann, indem man in (75) 
bis (80) a? — 0 setzt. Die Ableitungen von Pif(o) ergeben sich dabei ohne weiteres. 



218 


Ueher Verteilungen von Massenpunkten 


wilhrend P|,(o) auB (76) nooh dutch cine kleine Reohnung vereinfaoht werden 
muss. £s ist namlich, worm ich seize: 


— 

2 

_ L* 


/n^k 




.(81). 




Ich bilde dann : 


dP{a) r 1 -Cf , I f-l 


V 2 ir 


also 


27 r(l+a*) 

P(a). 


Jl fjzL, 

27 rJ 




27 r(l+o*)’ 
1 


arc cotg a + C. 


Nun ist aber nach (75 a) P(a) fUr a = 0 gleich J. Also muss C7 = 0 sein, wenn ich 
den arc cotg auf die Periode 0 bis tt beschranke. Seize ich dann den Wert fur a 
(nach (81)) ein, so wird : 


-P*(o) = ^ arc cotg 


/n — k 

y k~ 


(Periode 0 bis tt), 


weiter nach (76): 


P'{0): 


2 <r Vlt ( Vw - Ic Vw 


}■ 


Allgemein erhalt man fiir die Nullstellen der Ableitungen mit ungeradem Index : 

P („) = (-2)>-^1.3^5.7-(2^-l) f 1 _JL I 

Vir<r*®+^ ((n — A:)* Vw — A; n* sjn) 

FUr die Nullstellen der Ableitungen mit geradem Index ist mir keine allgemeine 
Darstellung gegliickt. Es ist jedoch : 

p iv/^\ ^ 2 (Sn 2A?) / ^ 

-TT//* (71 — k)a^^ n — k* 

« ^ ^ - 4 (15/1* ~ 20nk + 8A:*) 

“ 7rn»(n-A:)*<r* W ri:^k* 

r>vju/ \ 24(35//*— 70a*A:4-56nA;*— 16A:*) / A; 

^»«(n-l:)»<t<» ^ V • 

-P*‘(o) 


(n — A?)* < 

- 48 (315n^ - 840a*A; + 1008n*A:* - 57 6nk^ + 128A:*) /“T* 

V n-k* 


irn^(n — ky<P^ 


Unbestimmt integriert 
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Die praktisohe Auswertuug dieeer Beeultate wird vor allem von der Kenntnis 
der Griisse cr abh&ngen. Um dies cr zu bestimmen, wird man wohl von N MUcken 
einer bestimmten Art A experimentell eine Verteilung nach einem oder mehre> 
ren FlUgen, ausgehend von einem Zentrum Q, beobachten mtissen entsprechend 
Formel (33). Nachdem dann einmal von einer Milckenart A mit einer Flugzeit tx 
und einer linearen Fluggeschwindigkeit vx die Streuung des Abstandes von einem 
Punkt bekannt ist, kann man mit einiger Berechtigung die Streuung einer anderen 
MUckenart B mit einer Flugzeit und einer linearen Fluggeschwindigkeit v% 
berechnen, wie auf Seite 204—205 naher ausgefUhrt wurde. 

Im n&chsten Paragraphen soil nun an Hand einiger Beispiele die ungefahre 
QrOsse des a der Anopheles quadrimaculatus bestimmt werden. 

7. Bestimmung der ** Streuung'* filr Anopheles quadrimaculatus nach Beobach-- 
tungen von M. H, Barber und T, B, Hayne, 

Es muss hier allerdings von vomherein betont werden, dass es sich bei den 
folgenden Berechnungen nur um rohe Naher ungswerte handeln kann, da erstens 
die Zahl der zur VerfUgung stehenden Beobaohtungen zu klein ist und zweitens 
die auszuwertenden Versuche nicht eigen tlich zu dem hier erstrebten Zweck 
angestellt wurden. Prinzipiell hatte roan an verschiedenen Punkten und Tagen 
(wenn pro Tag ein Flug stattfindet) die Dichte der Miicken zu beobachten, wenn 
am Anfang des ersten Tages an einer bestimmten Stelle eine rnoglichst grosse Zahl 
gefarbter Miicken in Freiheit gesetzt wurden. 

Aus den so gefundenen Daten ware dann mit Hilfe der Ausgleichsrechnung aus 
Formel (33) das a zu berechnen. 

In der Praxis wird diese Bestimmung der Dichte an den verschiedenen Punkten 
nun ziemlich schwierig sein, da man im allgemeinen wohl kaum eine bcstiramte 
Flache ganz genau absuchen kann und da wegen Haufungen der Muckenanzahl in 
Stall ungen, Wohnungen usw. zur Berechnung der Dichte an einem Punkt eine 
etwas andere als die abgesuchte FllU^he anzusetzen ist. 

Ich fUhre deshalb als neue unbekannte Grosse die Flache F ein. Unter der 
Voraussetzung, dass F verbal tnismassig klein gegen ist, ist dann nach (33) die 
in der Entfemung r nach dem w-ten Flug auf F behndlich Anzahl Miicken : 

(1 — FH - 

(r) - (1 ~ Sr F4>^ (r) = e (82). 

Es sei also der Vernichtungsfaktor S Ortlich und zei tlich konstant. Er gibt den 
nach einem Fluge ausgeschiedenen Bruchteil von der am Anfang des Fluges 
vorhanden gewesenen Gesamtzahl der MUcken an. 

Bei den Versuchen hat man also an mOglichst vielen Stellen, rnoglichst gleich 
grosse Flachen F gleich gut abzusuchen (in dem F ist mit berUcksichtigt, dass von 
den wirklich vorhandenen Miicken nur ein Teil gefunden wird; insofem ist also F 
praktisch nie die wirklich abgesuchte Fl&che). Die Unbekannten F, <r* und eventuell 
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B sind dann so zu bestimmen, dass sich die Zahlen der an den einzelnen Stellen 
tatsachlich gefundenen Mticken mdglicbst gut den Zn (r) der Formel (82) anpassen. 
£s ist nun iiblich, die Unbekannten so zu bestimmen, dass nicht die beobachteten 
Zahlen durch Z„(r), sondem dass log j5n(r)durch logZn(r) mdglichst gut 

approximiert wird, da dadurch ein bedeutend gleichm&ssigerer Einfluss aller beo- 
bachteten Daten erreicht wird. Nach (82) ist nun: 


log (r) = log ^ + w log (1 - 8) + log ^ 1 

Ich setze dann : 

log(l — S) — 

iog^=y[ 


1 



(83). 


(84), 


und suche nach der von Gauss herruhrenden Methods der kleinsten Quadrate 
(s. Lit.-Verz. Nr. ( 10 ) bis ( 14 »)), die a?, y, z so zu bestimmen, dass 


S (log (r) - log (V))» 

ein Minimum wird. 

Von Barber und Hayne sind nun zwei Versuche ausgefUhrt worden, die ungefclhr 
den hier verlangten Bedingungen entsprechen (s. Lit.-Verz. Nr. ( 15 )). 


TABELLE I. 



0 

14*4 

15 

1 

50 

1 

1 


( 1 * 699 ) 

( 0 ) 


2 

24 

1 

1 


( 1 - 380 ) 

( 0 ) 

( 0 ). 

3 

10 

1 

1 


( 1 ) 

( 0 ) 

( 0 ) 

4 

24 

1 

1 


( 1 - 380 ) 

( 0 ) 

( 0 ) 

5 

4 




( 0 * 602 ) 




Dio Zahlen in den Klammern sind die Briggs’schen Logarithmen der dartlber 
stehenden Zahlen. r ist in 100 Fuss (=* 30*48 m.) gemessen. 

Tabelle I gibt von 2500 gefilrbten ausgesetzten Mticken die Verteilung der in 
verschiedenen Entfernungen und an verschiedenen Tagen Wiedergefundenen an. 
Die 2^hlen am Kopf der Tabelle sind die Entfernungen in 100 Fuss, w&hrend links 
die Anzahl der seit der Aussetzung verstrichenen Tage steht. 
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Setzt nuui nun ftlr die <p, y, m vom (84) in Fortnel ( 8 S) die N'^erungswerte : 

®e- 0 . yo-- 2 , zo = 0 ( 86 ), 

so erhftlt man 18 sogenannte Fehlergleichungen: 

+ly - r f - ^log Bn (r) - log ^ + 2 

** — 1 r 

in die die aus Tabelle I zu entnehmenden Werte einzusetzen sind, also: 

+ 0-.0-798, 

0-0-780, 

If + 117-90 4-0-901, 
f ;7 = 2f + li 7 - 45 + 0-600, 

tisw. 

I^ie f, v» if sind die an Wy y, z anzubringenden Verbesserungen, sind nun so 
zu bestimmen, dass \yv\^ ein Minimum wird. 

Dazu bildet man die Summen der Produkte der in den Fehlergleichungen 
stehenden KoefHzienten a,,, h^y c„ und l„y aus denen man dann die sogenannten 
N ormalgleich ungen erh&l t : 

[oa] f + [a 6 ] 71 + [ac] f ■» [aZ] 

[ 6 a] f + [ 66 ] 17 + [ 6 c] f « [ 6 Z] (87). 

[oa] f + [c 6 ] 17 + [cc] f ** [cZ] 

Die Ldsung dieses GleiohungsSystems gibt die zu den N&herungswerten hinzuzu- 
fiigenden Verbesserungen an. 

Die Normalgleichungen unseres Beispiels lauten: 

115 f+ 35 17 - 750-4 1-272, 

35 f+ 13 17 - 390*8?= -0-813, 

- 750-4 f - 390*8 17 + 25070 f = 265*9. 

Die Ldsungen sind: 

f« -0-123; 17-0-89; ?= 0*021. 

Also wird nach (85) : 

a: --0123; y--! !!; 0*021. 

FUr die mittleren Fehler ergeben sich die Werte : 

= ± 0 088 ; — ± 0*32 ; = ± 0-0035. 

* Es bedentet: [aa]ss2ara„, usw. 

EinBelheiten beziigUoh d«r Al»l 0 iiaiig der Theorie der Methode der kleinsten Quadrate sind u.a. in den 
aof Seite 236 angegebenen Warken su dnden. 
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Nach (84) erhalt man dann die gesuchten Grdsaen : 

1 -S = 0-76 ± 016*; ^ = 0078 ± 0 057 ; 

<r = (6-9 ± 0-6) 100 Fuss - (210 ± 20) m. 

Wie nach der Ausftihrung des Versuches zu erwarten, ist die Ungenauigkeit 
von F am grdssten. 

Der Wert fur 1 — 3 stimrnt ausserordentlich gut mit auf ganz andcre Weise 
angestellten Berechnungen von Herrn Prof. Martini uberein (s. Iiit.-Verz. Nr. ( 16 )). 

Aus O' folgt nach (31) : 

^ ^ = 186 m. 

Prof. Pearson (s. Lit.-Verz. Nr. (8), S. 36) schatzte nun die Qrosse von I zwischen 
160 und 500 Yard (91*4 bis 457*2 m.) in guter tJbereinstimmung mit dem hier 
gefundenen Wert. 


TABELLE 11. 


X "■ 

n\r 

0 

14*4 

15 


Beob. 

Thoor. 





1 

50 

46*5 

1 

0*6 

1 

0*4 

2 

24 

17*5 

1 

2*0 

1 

1*6 

3 

10 

8'8 

1 

2*1 

1 

1*8 

4 

(24 

4*9) 

1 

1*7 

1 

1*5 

5 

4 

2*9 


— 


— 


Tabelle II gibt eine Gegeniiberstellung der beobachteten (links in den Spalten) 
mit den aus (82) nach Einsetzung der oben gefundenen Zahlen berechneten 
Werten. 

Ein Mass fiir die Giite der Anpassung ist die von Pearson stammende 
Methode (s. Lit.-Verz. Nr. (18), S. 157 — 75, und Nr. (9), S. xxxi). Man hat zu 
diesem Zweck zu bilden : 

2 _ 2 Wert — theor. Wert)^ 

^ ~ theor. Wert 

* Man erhalt die mittloron Fehler aus denen fiir z mit Hilfe des Gauss^sohen Fehlerfortpflanzung- 

gesetzes : 

«»/- *\/ (»». + (»h, ij)’+ ••• • 

Ttif ist der mittlere Fehler einer Funktion / (a;, y, ...), wenn nt^p, wty , ... die voneinander unabhangigen 
mittleren Fehler von x, y, ... sind. Also z.B. wenn 

/(X. y, ...) = l-a = 10* 

ist, wird ; m/= ± 0*088 0-16. 

(Die Zahlenwerte sind aus obigem Yersuoh genommen.) 
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Ftir jede betrachtete Anzahl von Klassen entspricht dann den verschiedenen 
Werten von x* Wahrscheinlichkeit dafUr, dass eine durch Zufall 

entstehende Abweichung der beobachteten von der theoietischen Verteilung 
i^banso gross oder grosser ist als die der untersuchten Verteilung. Die zu den 
verschiedenen x* geh5rigen Werte P sind tabelliert (s. Lit.-Verz. Nr. (9), S. 26, 
und Nr. ( 19 )). 

In unserem Beispiel erhillt man bei ausser acht lassen des ganz aus der Beihe 
fallenden Wertes fiir a « 4, r — 0 fiir den Wert 6*54, dem entspricht : 

P = 0-8. 

Die Anpassung ist also als Uberaus gut zu bezeichnen. Man nennt n&mlich im 
allgemeinen eine Anpassung noch gut, solange p > 0*1 (naheres hierUber siehe 
Lit.-Verz. Nr. (ao), S. 104 — 8). 


Der zweite Verauch van Barber und Hayne, 
TABELLE III. 



Tabelle III gibt diesmal die Verteilung von 2400 MUcken. Sonst ist die 
Bedeutung der Zahlen die gleiche wie die der Tabelle I des ersten Versuches 
(S. 220). Wie man ohne weiteres sieht, sind die wiodergefundenen MUcken diesmal 
bedeutend regelloser verteilt, und es ist anzunehmeu, dass durch das Absuchen 
Oder sonstwie systematische Fehler in bezug auf die Anzahl der an den einzelnen 
Stellen wirklich vorhanden gewesenen Miicken vorgekommen sind. Behandelt 
man nun die Daten dieses zweiten Versuches genau wie die des ersten, so erh&lt 
man : 

1 _ s ■ 1024 ; ^ - 0 009 ; o- - 9-8 100 Fuss, 

Der Wert fiir 1 — 6 ist jedoch unmdglich, denn auf jeden Fall muss 1 — 6 < 1 sein. 
Ich habe deshalb bei der Auswertung des zweiten Versuches die Qrdsse 1 — ^ als 
bekannt gleich 0*7 angesetzb, welchen Wert man auf Grund des vorigen Versuches 
und der erw&hnten Berechnungen von Prof. Martini wohl als einigermassen richtig 
ansehen kann. 
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Sotzt man also in (82) 1 — 5 « 0*70, 

und fiihrt im iibrigen die analoge Rechnung wie beim ersten Versuch mit den 
entsprechenden Zahlen der Tabelle III durch, so erhalt man als Normal- 
gleichun^en : 

12 104 2?- 2-812. 

104-2^-8222 ?=74-8, 


mit den LOsungen : rf — 0*35 ; ?= 0*0135. 

Also wegen (85) : y = — 1*85 ; z « 0 01 35. 

Die ebenfalls aus den Koeffizienten der Normalgleichungen zu berechnenden mitt> 
leren Fehler sind ; 

rriy ■« ± 0*24 ; — ± 0*0092. 


Wegen (84) wird dann endlich : 

^ = 0 0224 ± 0-0124 ; <r ■= (8-6 ± 2-9) 100 Fuss «= (260 ± 90) m. 

Wie man sieht ist mit den Resultaten des ersten Versuches eine weitgehende 
Uebereinstimmung innerhalb des Bereiches der mittleren Fehler vorhanden. 


TABELLE IV. 



0 

1 

2 

5 

14-4 

1 

277 

12-0 

1 

11*8 

1 11*3 

1 8*5 

1 0*75 

2 

29 

4*2 

4 

4*2 

2 4*1 

— 


3 

4 

2 0 


— ■ 

— 

— 

— 

4 

2 

1-0 

1 

1*9 

— 

— 

— 

6 

1 

0*3 


— 

— 

— 

— 


Diese Tabelle gibt wieder eine Gegentiberstellung der theoretischen (rechts in 
den Spalten) mit den beobachteten (links in den Spalten) Zahlen. Man sieht 
sofort, dass im ganzen genommen eine sehr schlechte Anpa^ung der theoretischen 
an die beobachteten Da ten vorliegt. Ein Vergleich mit Hilfe der erwahnten x* 
Methode ergibt fttr P einen weit kleineren Wert als 0*0001. Die 5 starksten 
Abweichungen sind nun prozentual im Verbal tnis zu den Ubrigen derartig gross, 
dass fUr sie ein systematischer Fehler anzunehmen ist. Man ist deshalb in ge- 
wissem Sinne berechtigt, diese bei Beurteilung der AnpassungsgUte auszuschalten. 
FUr die dann Ubrigbleibenden 7 Beobachtungen erh&lt man : 

X^ as 6*2 das gibt : P = 0*4, 
was immerhin noch sehr gut genannt werden muss. 

Man kann also mit einigem Recht auf Grund dieser beiden Versuche ftlr <r 
einen Wert von etwa 250 m. ansetzen und damit die in dieser Arbeit behandelten 
Verteilungen fUr die Anopheles qiiadrimaculatus zahlenmassig berechnen. In 
folgender Skizze babe ich z.B. die Verteilung von einer Halbebene mit der Dichte 
100 in die andere nach 8 Flttgen dargestellt. 
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Die Abzissen sind die Abstande von der Qrenzgeraden, wfthrend die Ordinaten 
die in den verschiedenen Entfernnngen vorhandonen Dichten, dh. Anzahl pro 
Flacheneinheit, angeben. Die Langeneinheit sei 100 m. (Flacheneinheit also 
10,000 m.*). Vermehrung und Absterbeo sollen sich aufheben, also $ « 0. 

Dann wird die Dichte im Abstand m x 100 m. von der Grenzlinie nach 8 FlUgen 
zufolge(42): 

— 7 =— f (mitft—8, <r«»2*6) 

•Jimtr J 



s” 2 dx\ 


Die Werte dieses Integrals babe ioh aus den Fearson'schen Tabellen entnommen 
(s. Lit.-Verz. Nr. (»), S. 2 bis 9). 


I « 2 ist dort mit J (1 — a) bezeiohnet. 
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Der zweite Kurvenzug ist die Verlangerung des ersten bei 50-fach vergrbsser- 
tem Massstab. 

Herm Prof. Dr P. Kiebesell sowie Herm Pro£ Dr K Martini mbchte ich auch 
noch an dieser Stelle meinen besonderen Dank aussprechen fUr das rege Interesse, 
dass sie an der Fertigstellung dieser Arbeit stets bewiesen haben. 
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THOUGHTS SUGGESTED BY THE PAPERS OF MESSRS 
WELCH AND KOLODZIEJCZYK {Biometrika, VoL xxvii. 

pp. 145—190). 

By KARL PEARSON. 

These authors take populations ITi, I!, ... n< ... IT* and suppose that in any 
one of these k populations, the distribution of the y's for a given os is normal and 
homoBcedastic. Accordingly the frequency surfaces must be of the form 

4>i ^ ~ (i), 

where a®.* is the array standard deviation, and Si and fit the constants in the 
population n<, and nothing is stated about the form of <f>i(os), which A priori may 
be considered continuous or discontinuous. For the truth of the following results 
it does not matter whether we take a summation or integrate with regard to w 
between arbitrary limits. But the reader must bear in mind that we are summing 
for the parent population and not for any sample taken from it. We shall use 
7 ni^i and for the x and y means of the population, and a^^i and ^ 2,4 for their 
standard deviations, pi will be their correlation coefficient. For a sample from this 
population, we shall use y< for the means for the standard deviations 

and f'i for the correlation coefficient, and for an array standard deviation of the 
y’s for a given Xi, 

the mean of the x array of y*s the form of (i) shows us at once that 

== (ii). 

and there is accordingly a straight regression line for y on x. 

Summing for all values of x, it follows whatever be the form of 4>i(x) that 

mt,i=‘S( + 

We will now show that for any surface given by (i) 

We have, if Ni be the size of the population 11^, 

wm { (y — a< — )§*« + {?< + dy 

J - CO 

« S„x^i {x) [0 + + ffiOoi V 

« iSiVij^i + 0 i (a^i^i + m\i)] S^<l>i (x) 


16—2 
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On the Welch- Kolodzi^czyk Frequency Surface 

But Ni and therefore by (ii) 

F I ~ Si (Ao* »,i + 

or fit •“ (“)> 

whatever be the function 4>i{w), 

We may now proceed further and ask what is the form of for a sur&ce 
like (i)? 

We will find 

(<^\t + »»**.<) “ (®) v'2Tr (a-\t + fn\t) 

- <S'« 4>t (w) r”(i/-at-fitCP-hSt + fitwf e ~ dy 

J - oo 

= S„tp2(‘e) [^o.< + 0 + («<* 4- ZSt/StO! + fitO^) V27r<ra^ J. 

Thus Ni {<T\t + '«*»,*) = St [o*o,< +«<* + ^ifit”H.i + fit* (5*1.4 + w»*i, 4 )]. 

Hence by (ii) and (iii) 

+ »"*«.. * 5 *«.« + (^ + fit»H.,y + fi*^i.i, 

<^%i (1 - Pi*) = 5*0, 4 (iv), 

which is exactly the form 5*0,4 would take for a normal surface, and this whatever 
be the form of 

Now if an additional hypothesis be made that is to be the same for all the 
k populations TIi, Ila... n<... 11^, then for those populations we must have 



which signifies that for these k populations the variance of y in each of them varies 
inversely as unity minus the square of its correlation coefficient. It would seem 
difficult to discover any series of populations for which such a relation would hold. 
Personally I am not able to think of any series of populations for which the 
variance and the correlation would obey such an artificial, not to say ftmtastic, 
relationship except : 

(a) a series of populations with the same variance for y and the same correlation 
coefficient, i.e. 

Pi ■“ pi' values of i and t' (vi), 

or, (b) a series of populations taken from different parts of one and the same 
population. This would involve not only (vi), but and being the same, to say 
nothing of and as well as 

5^i,i » for s*!! values of t and (vii). 

Thus case (b) involves far more stringent hypotheses than (a). 

But if we accept (a) and then superpose upon it the additional hypothesis that 
the regression coefficient of these populations, namely is to be constant, we 
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have fixed BXid accordingly must be considered constant in this case, 

(a) therefore connotes that the three constants p^, are fixed for all k sur- 

faces, but the means may be different, which is not possible in (6), 

But we can now go further. Clearly — j 5 <*) will be constant for all k 

populations and accordingly the arrays of a? on y will be homoscedastic. This does 
not refer to the samples taken from those populations which may not be random 
at all, but to the populations IIi, Hg ... ... Ilg themselves. 


Now the most general equation to a surface having homoscedasticity both 


ways* IS 


' Zoe 




(viii), 


where 7, mi, m%, Xt, X,, ci" and c," are arbitrary constants. This has for its 
regression curves 



S^.-C-ilog(l-^w,s^x-) 

(ix) 

and 

®^»ci-llog(l-^mxe^«') 



Now by hypothesis (i) we are to have (ix) linear. 

Let us expand the logarithm and the exponential. We have 

= Ct + ^ 4- Xi 0? + ^ + . . .^ 

+ "h 8X1 a? + ^ 9 Xi*a?* 4- . . .^ 


4 - etc. 
s* c* 4* m%, 


(i) 


. mg*Xa . Wa*Xg* . 

2X1* 3 Xi« 


4-fl7^mg4- wg*^4-wig*^4- ...^ 

+ 2 I 2mt*Xi 4- 3mg® 4- . . 


4- etc. 


To make the right-hand side linear we are bound to make Xa/Xi finite, and both 
Xi and Xt zero. The infinite constant now may be added to the arbitrary constant 
c%; the coefficients of all powers of x vanish except that of x which is finite and we 
have 

+'nH*x (xi). 


* S. Nanimi, Bi metrika, Vol. xv. jqj. 82 — 85. 
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But the very same values of \i and X® turn (x) into 

Xy = Cl' + wii'y 

Accordingly we cannot have a frequency surface homoscedastic both ways and 
with linear regression one way, without linear regression the other way. 

But a frequency surface linear both ways and homoscedastic both ways can 
only be a bivariate normal surface. If this result be not familiar, it can be deduced 
from (viii) by making the ratio \2/Xj finite and Xi and X* both zero. 

Thus it appears to me, without being dogmatic, that unless a series of surfaces 
can be produced in which the relation (v) holds other than those supplied by 
(a) or (6), we do not narrow our hypothesis by supposing that the populations 111, 
TTj, ... TT^ ... Tlife are all normal surfaces differing the one from the other only in 
the position of their means. These means will not of necessity be collinear; this 
depends on the constancy of supposing that we have accepted the hypotheses 
that A a-re the same for all the populations. 

The thoughts here recorded do not question the validity of the processes by 
which the authors cited would deduce the constancy of or But they do 

tend to suggest that the surface (i) from which they start is, when the hypotheses 
~ ^o.<' a-nd = are applied, not so general as they, perhaps, imagine and 
that accordingly there may be other, possibly as simple, methods of attacking their 
problem as that provided by them. 

Let us consider first samples which may be supposed taken from a bivariate 
parent normal surface defined by means mi, m2, <ri, 0-2, and correlation p, and let 
any sample be defined by y, Si, Sa* and r. The frequency surface in five-way 
space is known to be^ 

_ 1 + ( y - 


X e 


_ 1 JSl® . 
2 


- _ _ a 




Jkf-2 ^ _ 

(! — ?'*) 2 \dxdyd%xd^%dr\ 


= ^0-^1 (^, y) X J^a (2i , Sa, [d^idyidSidSadr] say, for brevity (xiii). 

There are five variables here, i.e. those of the sample 5 ?, y, 2a, I propose to replace 
them by the five other variables x, a = y — /3 r 2 a/ 2 )i , 2 i , and 7 « 2 a Vl — +. 

In doing this I shall throw any numerical factor or any function of the parental 
constants mi, ma, si , S2 and p only, which arises as a multiplier of Ft or into Zq. 

I shall begin first by getting rid of y in Ft (x, y). After re-arranging 

in powers of x, we have 


Ft{x,a,fi) — e 2 L^| 8 ^q | 








where 


( 2-1 


^P^Sx ^ $*8x* ^ ^pfio-x ^ a, 

8% s,* at <r,* 


(xiv). 


* We write (1 ^ and = -‘p^)/Mt where M is the size of the sample, 

t The reader must be careful to bear in mind that a, /9, 7 here are all sample and not parent- 
population values. 
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Now m does not occur in Sit &nd thus we see if a and are fixed 

W still follows a normal curve round the value 



mean q (xv), 

with standard deviation **si/v^Q (xvi). 


We see that this standard deviation is independent of a, which only influences m. 
We now transform our five variables p. Si, Si, r to the variables a, 2i, and 7. 
Evaluating the determinant we find 

^ 

dMdyd^xd^%dr ^ ^ •Jl-^dxdadZxdfidy, 

We can now integrate out for W and have a four-way frequency surface : 

2 VO =vT^J^,(2i,Sa,r) [dad^xdfidyl 

1 T*^ 

The next stage is to express the function ^ Ft (Si , Si, r) in terms of 

Si, /3 and 7. What we have to express is 


e 


2Vv 


8-1 ta “ * 


and this leads to 


or 


'iS) 


Afj-8 
2 , 


Jtf-S 




\ 8 i/ 


2V 


\M -3 


We can now write our fourfold surface of frequency in the form 

1 -.1 (< * - ~~ p“) 

z' dadXxd^dy^ zq -j=.e 2 VO dad^ 

V Q 


X c 2 *1* 




cZSie 




lf-8 


(^7 (xvii). 


This admits at once of being integrated out with regard to Si, leading us to the 
complete F- function and other factors which can be thrown into the ^o^ 8 lb 

well as the important factor • Thus we reduce our expression to a three-way 
surfikce of frequency 


1 1 (tt - (1 - p*) « i Ilf /rk/\JJf-8 

F* dad^dy =* Zt* e 2 VO da dfi x c “ 2 V dy, . .(x viii). 

This result is of considerable importance ; we see that the three-way frequency 
surface breaks up into two independent frequency distributions, the one the 
bivariate frequency surface of correlation for a and and the other the frequency 
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curv e for the distribution of 7, the standard deviation of the arrays of y on i.e. 
2a Vl — r*. This latter^ as pointed out in my memoir of 1926*, is precisely of the 
same form as the distribution of the standard deviations of samples from a normal 
curve with the number of the individuals sampled reduced by a unit. 

It is accordingly clear that in the problems considered by Messrs Welch and 
Kdodziejczyk the question of whether their at (= our aa,<) in a series of samples 
is the same can be treated without regard to the a and /S of the regression line, 
while the a and p cannot be considered separately as they are correlated. We 
shall consider later how to deal with this distribution of cr^<. We now turn to the 
correlation surface 

e'" ^ 2 «,*« [dad/3] (xix). 

This can be integrated out with regard to a, and gives us 



This is the distribution curve for the regression coefficients 0 of samples of size 
M drawn from a bivariate normal surface. It is a curve of my Type VII, symmetrical 
about the regression coefficient of the parent population, pa^/ai , and as M increases 
it approaches a normal distribution f. Remembering that a^fax we remark 


that 

^ = pa-t/crx , (Tf = ^^=1 If (xxi). 

The odd moments all vanish and ^ (M — S)I{M — 6), 


For low values of M the moments of the P curve become infinite. For M^aS 
fjL 2 is infinite, and for Jf « 4 the value of ^4 is infinite. Generally M must be > n + 2 
for the nth moment to be finite ; i.e. if we desire the nth moment to be finite, the 
number in the sample on which the regression line is based must be > n 4 - 2. 

Returning to (xix) we see that it may be written 

z”' = Zo'’'e 2 (««)• 

Since Q is a function only of P we see that for constant regression coefficients 
a will be distributed normally about the 

Mean Pmx , 


* R.S, Proc. Vol. 112 A, p. 4. 

t I disoovered this distribation in 1926 (see M.S, Proc. Yol. 112 A, pp. 6 — 7), but Mr Welch has 
pointed out to me the proof then given was not valid. 
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with 




< »~i^ 

VJJf 


O'* 


(aa* — 2/o^a'iO‘a + )8*<rx*)^ 


Thus we learn that the regression of a on /3 is linear, but the 
scedastic, the variance being given by a hyperbola. 


(xxiii). 

system is hetero- 


I have tried unsuccessfully to integrate (xxii) for ff, so as to obtain the distribu- 
tion curve for a. This I fear is unlikely to be achieved, but the moments of the 
a curve can be found, and if the reader he so inclined a curve of one of my types 
fitted from the first four moments. This is the method adopted in a similar problem 
by Mr Welch (p. 161 ), but in the case of samples based on few individuals, I feel 
doubtful how far this approximation to the true distribution curve of a variate 
may not be misleading. In the case of many individuals it would for practical 
purposes be adequate. 


We will turn now to the moments of a. First we will find the value of Zo” 
supposing the total frequency to be N, We have 

r+oo r+00 _ 1 — + (g'2/<yi) 4 - wtj 1 


and 


R2' * — p 


f!? 

ori ' 


Integrating out with regard to a, 

r'<-”V2w<r,VQ 1 


N^za'‘ 


Vif Q*(w+i) 


Vlwo-sV" 1 


V 3 r 






dR^' 

*J% 1 T 




,Af+l r+oo 


W 1- « (c* + 

0*1 ^ 




where 

Hence 


JV = V2ir — (t (M— 1 )) 

<ri (1 

N<ri (1 - p^jkw-t) V jg r Jf) 


«0 


V27ro-s*r(i(Jlf-l)) 


.(xxiv). 


While considering the value of the z^b it is worth while writing down what 
value arises from the 7 distribution — i.e. 

1 V* 


- Z JL. 


where *» <ra*(l — p*)/Jf— of the three-way frequency surface. 
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1 • 

Putting V = H 2 » integrating from 0 to oo , and taking the total frequency to 
^ ^2 

be unity we find 

Vat 


or the distribution of 7 , the array standard deviation, is given by 

\I1S 

■ e 2 




.(xxv). 


<ra VT - r U if - 1 ) 

We are now in a position to write down the complete three-way surface for the 
frequency distribution of a, ^ 9 , 7 . It is 

, ^ (1 - 1 -.A 


My* 


Qi(Af+l) 


1 My* y \/My 

X e a.rsMi-p’) ( fr=^a) Ldadfidy} (xxvi). 

\<ra V 1 - P®/ 


For the two-way surface of a and /3 only 


^ iV Vi (1_- p ®) * r (jM) - 1 1_ 


V 2 Wr(i(ilf^^l)) — ^ ^f^[d«d^]...(xxvii). 

The importance of these forms is that for any hypothesis as to a, /3, 7 we may 
require to find the best values to take for mx, 7n%, <ri, cr^y and p in order to give 
the most probable values to the observed quantities'**'. 

We now return to the moments of a, applying formula (xxvii). It is necessary 

to find 


r+00 r-^ a 

ind J j 

J —00 J — O 


zadadp to obtain the mean of a, or a. Now 


Na = V 


/•+00 f +00 l (a-TOa-t-/3i>ti)2 

{(a — -h ySmi) 4- (w* — e 2 a^~QjM 


a: 


and if wo write X ~ a — wg 4- /Smx, 




dad/3, 


M-ao f 4 -oo ^ 1 

^ J^^{Jr + (m2^y8mi)}e 2a^QiM.—^^dXd/3, 

the integration of the first term in the curled brackets with regard to X is zero, 
and of the second term is V27r v^Q/ Af o-g x (mg — /9mi). Accordingly 


Na = Zo" j ^ IT, (m, - /9mx) d/3 

<r, j ^ — wiip ■ 


' V27r 


diJg' 


VA/ 




where i 2 g' = yS — p ~ as before. 

O’! 

* The method was first used by Gauss to demonstrate the arithmetic mean of observations and the 
best value of the standard deviation in the case of the normal curve. It was largely applied by Professor 
Filon and myself in dealing with the variability of frequency constants in samples (Phil. Trans. (1898), 
Vol. 198 A, pp. 2Sl-~88). 
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Since the Ba curve is symmetrical, the integral involving the Rt term in the 
curled brackets vanishes, and we have 

*o'"V2^ 




af r+oo 


Take JBa' *= ^ Vl — p* ,y / ^ , and we find 


dRt' 


(^.0 


IS- 




-( 3 ) 


-») dw 


1 r(^>r^(if-i) 

(i_p«)*w-w r(i3f) 


V2 


TT era 


and accordingly 

ax. ,,, V27rera* 1 ri(if~l)/ 

Vi/ .ri (1 - r(ii/) «ri, 

Substituting for Zf!” from (xxiv), we have 


ri(i/-i) 


-)• 

O'!/ 


^ Tna — 


o“i 


.(xxviii). 


This checks the value of giving to the mean a the population value. But it 
is not obvious d priori that the mean value of the triple product will take 

the population value. 

We will now proceed to compute er«, the standard deviation of a, 

r + co r + oo 
J — CO J — oo 


r + oor+oo 

za^da^zo'" {a - ma 4- /Swi + ma - e 2 

J — ooV - CO 

1 


1 (g ~ wta -f 

dad/S. 


Qi(Af+l) 

Writing a — wa + = .ST as before, the curled bracket term gives us 

X* 4- 2 (ma — /Swii) JT 4- (rwa — ; 

1 Jfg 

this has to be multiplied by e and first integrated with regard to X. 

This gives us 

V2»ir, y / + 0 + (mt — ^Snii)* V^<r, y/ ^ , 

i:: 

and this after some rearranging and introduction of i2a' « >9 -- p — 


+ R»‘(^ + rm* 


KW 


')}(sy 


dfi,' 

(c» + R,'*)i^ • 
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The term in the curled brackets in will disappear on integration, since the R% 
frequency is symmetrical. Hence 

V27r 


rs 


za^dadfi 




V2' 


iTT cr^Zo 


But 


{?f [ 3 - — I (1 - p‘)} f_ 


dRi' 

-00 (C» -f iKa'*)*" 
dR2' 


and 


i 


« (c* + 

dRt' _cP^/wr(i(M-l)) M-2 
+ ■ r(iM) JI/-3' 




(xxiz). 


i: 


Substituting in (xxix) these values, and also from (xxiv) the value of zo"\ we 
have 

- « <'•• + 5-) - if 


Or, sinde d « '-rrixp^ , we have 

« Jlf -- 3 


1 + 


Wi* 

0*1* 


1 


) 


.(xxx). 


The approximate values given by me for large samples in 1913* were, for a and >9, 




and 

O'! 




Jf 


f5! l_np* . 
' cr,a ^ - 3 ’ 


as against the exact values 

« <ra*(l~p*)/, nit^ 2\ , . . 

” M - 3 V <r,* m) ~ ; 

thus the former values are seen to be closely approximate for large samples. 

To indicate the dependence of a and /3, we will find their correlation coefficient, 
at the same time reminding the reader that while the regression curve of a on /S 
is linear, that of /3 on a may be very far from being so. It is indeed so complicated 
as to be unmanageable. 

Proceeding as before, we find 

S(.a0) s_ wi, er,*l-/»» 

-~N' ^***’>’ 


leading to 


= - 


7 


rax 

o-x 


1 + ^^- 


2 


.(xxxii). 


CTi- M 

♦ Diometrikat Vol. ix. p. 9. See also Phil. Trann. (1898), Vol. 191 A, p. 246. 
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The correspondinif values found for large samples in 1913* 

mi 


^ 1 ’-p^ 

CTi trx M 


and 


<ri 



were 


respectively. These approximations are now justified by the determination of the 
accurate values for samples of any size. 


One more theoretical point may be noted. If y* be the mean of the x array 
of y*s, then 

a fix, 

and the equation y* — ^ (^ — ^9) a? is exact. 

Hence we have 


*** + x^a^ -f- 2r«^ 

“ JfeT- 3 3f) ~ ^*3 (1 - p*)j , 


«r,»(l-p») 


v-^ ~ p > f 

<^9.- —M-r- i 


mi® 2 a: mj a?® 


1 ^ 

1 H s — 

cTi* <ri 0-1 


<ri* 


--1 

Mj' 


Finally 




o-3*(l — /»*) /, . (r — mi)* 2\ 
M-s + ■ o-jS— j/; 


(xxxiii). 


This gives the exact standard error of the mean of an array as found from a 
regression line. The value for large samples found in 1913 was 

3 j <r3*(l-p *) /, (r - mi)* \ 

M 0 - 1 * )• 

which is accordingly justified. 

We note one further point. Formula (xxxii) may be written 

1 


= — 




.(xxxii*^), 


and accordingly we see that if the coefficient of variation of the x variate be very 
small the relation of a to fi tends to become perfect (negatively); but if this 
coefficient of variation be large, the correlation tends to be small. Since coefficients 
of variation are in practice usually small, we see that the relation of a and fi is 
close, and the regression curve of fi on a will tend to be linear like that of a on fi. 

Starting with the variables a, fi, 7 , we have seen that 7 is distributed according 
to a frequency curve which is absolutely independent of a and fi, but that the 
latter are given by a bivariate frequency surface, exhibiting a high degree of 


Biomstrika, Vol. ix. p. 10. 
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correlation. We propose to replace a, 7 by three new variates which are mutually 
independent. These are as before 


+ and c* 

aj.7^ a\%’ therefore F=(l-p*)/Q, 

and finally 

- 1 7* 1 

2sa2“2 cra*(l-/3^) 


CTl* 


(1 -/>*)! 


> 


(xxxiv). 


As to the limits they are independent. Those of Z depend only on 7 and 
accordingly are from 0 to + oo . Those of Y depend only on / 8 , but it will have 
equal values when 0 — ^ takes plus or minus equal numerical values. Hence 
integrals for Y must be doubled, its limits are F= 1 , for 0^0y and F = 0, for 
0 - 0 '=- ± 00. 


Turning to X we may write it, having regard to (xxiii), in the form 

X = y ~ ^ ~ rrix) ^ y — ma ~ ( ac ~ mi) 

<r*.^ 


Now y-m^- 0{x- m^) « - (y* - y) + ^ ~ a?) + y* — Wa - (a? ~ mi), 

where y* is the mean of the sample array we are considering, and accordingly 


Again a (p— ^ 5 ) . ^ (S— aH-x) — S (JB—mi) > 

since ma and mi do not vary for the sample from the particular population, which 

has given the regression coefficient 0 and the mean x and p. Accordingly we have 

Let us consider the geometrical meaning of A. S (mi, ma) is the mean of any 
one of the k populations, and T(x, y) of the sample from this population with 
regression coefficient 0 (see figure, p. 239). AB is the regression line of this 
sample ; CD is a parallel line through 8 of which the equation will be 

y — m^ — (a? — mi) =* 0 . 

ON = X gives the position of the array of y's for this sample of which the mean is 
yx at P. Now 

PQ « PN — QN = ya? — {ma 4- (a? — mi)} — y* — ma — (^p — mi). 


Thus we have 


X 


PQ 

standard deviation of PQ 


(xxxiv^). 


But by hypothesis the arrays are to form normal distributions and accordingly 
fixing any particular sample by 0y x, y the limits of X are from — 00 to 4 - oo . 
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We now proceed to transform our equation to the three-way surface from a, /S, 7 
to X, F, Z, 

In order to transform the integral we must replace dad0 by 


dXdY 


da 

da 

dX* 

dY 

d/3 

d/3 

dX^ 

dY 


» -jr » 17- dfx d^ 

dXTY* 


since y 8 is a function of Y only. 

Now a = <r 2 VQ/ArX-hma — i^mi, where both Q and ^ can be expressed as a 
function of F only. Accordingly 


a%\/QIM (xxxv). 


Further since 


/9-^*±cVl/F-l, 

di3 

dY" 

where we may drop the double sign on the understanding that we have to double 
the result if we integrate out for F. We will now write down the transformation 
of (xxvi) remembering that 1/0*= F/(l — p*), and that 


T icF-t(i - Y)-i (xxxvi), 


dad/3 pT-* (1 - F)"* dXdF. 

JlTa.d- p*)iW-» e-i^ F y 

2*<"-»7r<r,*r(i(Jf-l)) VM P U-pV 




X r-*(l - y)-*dF4=^ P*2*(Af-« e-t dZ, 

' y/2 
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or 

zdXdYdZ - - e-t (i _ y)-i dXdYdZ 

2 V27r r (J (if - 1 )) * ' 

4(^-lXi) ^r(i(if-2))‘*'^ 

(xxxvii). 

Thus our three-way surface in a, y hew been reduced to the product of three 
frequency curves independent of each other ; namely a normal curve in X, a Type I 
curve in Y and a Type III curve in Z, 

Let us now consider how we may proceed. If we want to know whether on the 
k observed series we may suppose the array standard deviations are constant we 
calculate k Z*b, and ask if these Z*s are a random selection from the Z curve. To 
do this we calculate their probability integrals by means of the Tables of the 
Incomplete V-Functiony and then find the value of by aid of the same tables*. 

If we wish to know whether the k series have the same regression coefficient, 
we calculate the values of Y (or, if we prefer, for these k series, and by aid of 
the Tables of the Incomplete ^-Function find the values of the k corresponding 
probability integrals, then by the Tables of the Incomplete V-Function again find 
Pa„. This may be done without first supposing the A:-array standard deviations to 
be the same. If we suppose both the array standard deviations and the regression 
coefficients to be the same, then we have 2k probability integrals, and we can 
determine Pa^ for the 2k as one set, instead of two sets of k. 

Lastly if we want the equations to the regression lines to be the same we have 
to deal with Xy which first introduces and ask whether the k values of X form 
a random sample from the X normal curve. We calculate the probability integrals 
of our k values of X from the normal curve probability integral table f and we 
thus have three sets of k probability integrals, or one set of 3A?, to test by the Pa^ 
method. The advantage of this method is that in each of the tests of k X*b, Y*b 
and Z*B we are working with the actual curve of distribution, and not with an 
empirical curve fitted by two moments. 

Now I have said above that we must compute the observed values of the X^s, 
Y*s and Z's, but this is not possible unless we have a knowledge of the constants 
of the k parent populations. These must be obtained in the customary manner by 
choosing them so as to make the probability of the observed results a maximum. 
I have expressed the opinion that on the assumptions of common array standard 
deviations and normal array distributions we lose little, if anything, in generality 
by assuming the k populations to be all normal distributions. The determination 
of the constancy of the array standard deviations indicates that we must maximise 
the product of the k Type III values to find 8% or o*# fl — This constaqt is 

not to vary for the k populations. If we now desire further that the regression 
coefficients should be constant for the k populations, we may do it in two ways : 
(a) By supposing each population has a different in which case the c<* are all 
different. But if we have fixed the array standard deviation and the regression 
* Or by aid of Miss F. N. David’s Table, B%€nnetrikat Vol. zzvi. pp. 7 — 11. 
f Tables for Statisticians and Biometricians y Part I. pp. 2 — 8. 


* N e-^^ dX X 


if( 
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coefficient of *all siirfEtces as the same, this is only rendered possible by 
assuming that for our k populations, oc 1/(1 This I have termed the 
relation (v), and find difficult of acceptance : see p. 228. Turning to the other 
alternative: (6) By supposing Pi 1*^ have the same value for all the popula- 

tions; then the fixing of demands that shall be the same for all the 
populations, and we have fixed. In this latter case, since 0 * 1 , 0*2 and p are fixed 
for all populations it follows that our k populations may be treated as described 
by a single surface movM parallel to itself, but having a different mean for each 
population. If we accept (a) then 0'*2,<(1 — p^) *= constant =» 0 * 2 ,* 
accordingly there is a hyperbolic relation between the two standard deviations — 
or a linear one between their variances — ae singular and remarkable a relation 
among k populations as (v). 

When we turn to the JT distribution and consider the most likely values for 
17^2,4 and mg ^4 defining the a; and y means of the tth population, we find, if there be 
only one array value sampled from that population, that the most likely values 
^i ni ^4 are not sejiarately determinable, but that the best position for 
them is on the observed regression line of the tth population sample. This leads 
to aa 0, or the probability integral =* *6 for all k samples. Accordingly we reach 

-r (-693,1472 A;- 1). For example: Pa 4 « ‘3020, Px« « -0489, « 0093. 

Thus we require the number k of populations to be large before it indicates an 
improbable series of means when the mean of eeush population is placed in its most 
likely position, namely somewhere on the observed regression line of the sample. 
With a small number of populations, such as four, no discrimination against the 
hypothesis that they have independent means is likely to be provided by the Jl 
distribution. If we suppose that all the k populations may be represented by 
normal surfaces having the same 01 , 02 and p, but differing means, we get no 
adequate test from the distribution of A, but if we suppose them to have the same 
mean, i.e. all the k samples to be drawn from the same normal surface, or possibly 
each sample from a different region of the same surface *, then we shall obtain a 
test from the X curve. We can again obtain a test from the JC curve, if we break 
each z sample up into two or more subsamples, for in this case we have equations 
enough to determine and m^ 4 . If we have s subsamples in the ith sample 
of the tth population, and we have already fixed c*, and the array variability, we 
shall have in order to determine 04 and 1712^4 to minimise the expression 

S — Uj 4- 7n2 4 {04,4 — 


with regard to and which leads to the equations 






8 


C* + Wu, - C* + (/S,., - /3)* <=1 c* + - /S)* 


Sf s 






n,t 




i=.l c* 


.(xxxviii). 


Biomeferika xxvn 


This is the condition (6) of p. 228. 


16 
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On the Wddh-Kolodziejczyk Frequency Surface 


Since ^ x clearly have equations to determine and 

the means of the tth parent population. If there be only one array for the sample 

the sums reduce to a sinsle term, the factor =— divides out of the first 

fi /Q 

equation and the factor ~ - from the second, and both reduce to 


which is m^i — (^i.t is equivalent to saying that the most probable 

values of and lie on the regression line of the sample, but it does not 
further define their position. 

There are innumerable hypotheses we can make with regard to the constants 
Pi * surfaces. The method of testing them is the same 

in all cases, namely to deduce the most likely values of these constants by the 
usual method of maximising the total probability of the observed event, subject 
to the conditions imposed on the 5k constants, and then apply the Pa„ test of 
reasonableness to the k values of X, Y and Z, 

1 will now proceed to illustrate these methods on the data provided by 
Mr Wilsdon of the Building Research Station, which were used by Mr Welch 
in his paper, and which Mr W^ilsdon has most kindly permitted me also to use. 
The raw data will be given here as we need to table more constants than are 
published by Mr Welch. We agree absolutely with Mr Welch’s values of and 
have adopted his values of the array variances ; all the other constants except the 
latter have been worked out by Miss David and myself. The populations each 
represent eight tests of the crushing strengths of concrete and mortar in lbs. per 
sq. inch ; the individual results represent different cements used in making the 
mortar and the concrete, the same cement being used for both. The four groups 
1, 2, 8, 4) represent tests made when the age of mortar and concrete was one 
day, three days, seven days and twenty-eight days respectively. The regression 
line is for concrete (y), on mortar (x). 


Orudiing Strength 

{i = l) 

Crushing Strength 
(1 = 2) 

Crashing Strength 
« = 8) 

Crushing Strength 
(i=4) 

Normal 

Dry 

Normal 

Dry 

Normal 

Dry 

Normal 

Dry 

Conorete 

Mortar 

Conorete 

Mortar 

Conorete 

Mortar 

Conorete 

Mortar 

888 


2608 

6847 



5568 

8947 

1738 

4032 

4455 

8165 

5678 

9358 

7298 

10658 

256 

1368 

1560 

3242 

3189 

4533 

5568 

6318 

1914 

4772 

4199 

7716 

5362 

8927 

7550 

11000 

724 

2334 

2363 

4477 

3448 

5462 

5013 

7165 


5680 

4667 

7606 

5765 

7970 

6963 

8972 


2304 

2200 

4783 

3335 

6314 

5860 

8458 


4021 

3476 

6809 

4492 

8256 

6138 

9548 
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From this table the following constants were deduced : 


Gonstant 

Sample (t==l) 

Sample (i~2) 

Sample (<3s8) 

Sample (i»4) 

h 

<f’hi 

U 

A 

1280 

3506-376 

644*5122 

1448*3196 

*986,649 

*43907 

-269*0895 

3323 

3191 

6067*876 

1092*0896 

1666*1133 

*965,638 

*63677 

-672*8414 

9541 

4438*876 

7288*376 

994*9920 

1693*9983 

•892,629 

*65719 

4-377*873 

7920 

i 

6244*375 
8883*25 
860*6380 
1491*6566 
*837,205 
•48807 
+ 1953*1193 
5926 


As we have seen there are five independent constants for each population on 
the assumption that it may be treated as a normal population, namely, 

Pi* We may replace these by five equally independent constants, namely, 

7<* =»••*.< (1 - "h.*. 

and 

Hence the distribution curve of y/s in the ith population is from (xxv) 

VF - 5 hi rj 1 

( 7* ) 

where the constants will not occur in any other of the three independent 
distributions. Thus in determining the most likely value of we need only 
consider this the Z distribution. Now the hypothesis we are to test is whether 
the array standard deviation is the same for all k populations, or = 7 a quantity 
independent of i. Accordingly we must make 

, ^ . . - f f (?)■ p™d«ct 

a maximum with regard to 7. 

Taking logarithmic differentials we have 

f 1 T 1 



Hence the most likely value of the array standard deviation is given by 


(1 - f>i*) = 


Ar~2 


(mean value of 7<*), 


and the quantity we have called by 


— 


if -2 


(mean value of 7^*) (xl). 
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On the Welch- KolodmeQcssyk Frequency Surface 


Here we are met by the old difficulty to which we have referred, namely, we 
have to suppose that 




.(V). 




We prefer to take as our hypothesis that the k populations are all normal, and 
having the same 0*9 and p. This justifies the form we have assumed for the 
distribution of 7 ^ above, mid we shall now show that the h 3 ^pothesis is not con- 
tradicted by the Mortar-Cement data. We may also remark that there seems no 
greater reason in the present ctuse for supposing that the crushing strength of 
normal concrete (y) on dry mortar (x) is given by a linear regression, than vice- 
versa that of X on y. Thus we may propound an additional argument in favour 
of the populations being normal, if any were required beyond the artificiality of 
assuming the truth of (v). 

In Mr Welch’s illustration A? = 4, and we have from the four tests the four 
array variances of : 

7 i» « 3323, 7 a* = 9541, 73 * = 7920, 74* * 5926, 

giving a mean of 7 * = 6677*5 and by (xl) «a* equal to ^ of this value since each 
test consisted of Jlf ** 8 individual cases. Accordingly Sa* = 1112 ' 9 l 6 , 

We have now before us the following problem : The most suitable value of 8% 
being the root of 1112*9l6, for it gives the highest probability of the observed 
result, may the values observed of 7 be taken as a reasonably random sample from 
the curve of distribution 


? 


_ 1) 

I know no better way of solving this problem than by the method, and we 
will proceed to calculate the probability integrals of the four 7 ’ 8 . Write 

-KS)'. 

and we find that the probability integral becomes 


Pi' 


1 frA* 

dvi. 


Hence in the notation of the Tables of the Incomplete T -Function^, p ■ 
and this equals 2 , so that 

1 




Ci*) vjrt-i ’ (2 vV 


Thus all we have to do is to calculate the four values of 
into the above Tables. The values of 


.(xli). 


I 7 

— _ £± interpolate 

2 V3 «,* 


, » .2= = ■/«* 

V3 2226-8333 


X -677,3503 


PabliBhed by the Biometrika Office, UniverBity College, London. 
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are as follows : 


til « *861,94101, ti 9 - 2*474,8081, u, - 2-064,3382, and ti 4 «= 1-687,1222. 

From these by merely linear interpolation into the F-function table which will 
suffice practically we have, if p' «« 1 — p, 

Pi - *189,6896, log Pi - 1*278,0485, pi' « 810,8104, 
pi « *800,7516, log pt 1*908.4977, p,' « *199,2485, 
p* - *689,4926, log p# - I*838.5297, p#' « *310,6074, 

P4 « *496,8960, logp 4 « 1*696,2666, p 4 ' « *608,1040, 

Hence 

as log PiP%PzPa ■» — 1*288,6686, *= log piptp^p^! =* — 1*598,2286, 

and , since n = 4, gives us 

vw logios 

^ . = l-477.8726 and 


log pi'- 1-908, 6614, 
logp,' = 1-299,3952, 
logp,' = 1-492.0720, 
log p« -1-701,6678. 


1-840,0228, 


-868,6890 * "■ ' -oea.sooo 

leading to P*,= J (1-477,8726, 3). and P*,. = / (1 -840,0228. 8), 

or = -34288, P^. =. -60149, 

Q*,-1-Pa„- -66712, -49861. 

Thus whether we reckon our probability integrals from the left or right end of 
our frequency curve the same conclusion must be drawn, that the four array 
standard deviations form a most reasonable sample from a set of normal populations 
having a common array standard deviation 

VI -p»- 2^2 (mean value of 7 ^*)^ = V8903*3333 

■■ 94*3575, with standard error 23*1365, 




as against the observed values, 57*6456, 97*6780, 88*9944 and 76*9805. The only 
deviation claiming special consideration is the 6 r 8 t, which is about 1*59 times the 
standard error, and a deviation as small as or smaller than this would occur in 
about 19 in 100 trials*, or about *76 times in four trials ; it occurs once. Mr Welch 
gives a far higher improbability, = *0126, for the four values being a random sample 
from his curve of distribution of the 7 * 8 . I think the difference arises from our 
methods. I have given, on the assumption that the surfaces are normal, what I 
think is the real curve of distribution of the 7*8 from which a little sample of four 
has been taken. Mr Welch obtains a compound criterion and selects a curve of my 
Type I from the first two moments to describe the variation of his criterion. While 
I believe for considerable material this method might give adequate results, I have 
grave doubts whether, if there really exist only four observed values, it may not 
possibly be misleading. At any rate, if my arithmetic be correct, I cannot raise 
objection to these observed values being a random sample from four normal 
frequency surfiEuses giving exact curves of distribution, and I will start on the 
assumption that they do. 


See px ebore. 
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On the Welch-Kotodziejezyk Frequency Stirface 


We now pass to the question whether if ^ p^he treated as a constant 

for the arrays of all the k populations, the values of «■ pi<r%,il<r-^,i can be considered 
as a random sample from a series Hi, 112 , ••• Hi. of normal populations having the 
same but not necessarily the same Wi and i.e. the a/s and ^/s are not 

necessarily the same. In order to get rid of a,- we must start from the distribution 
of iZ, or in each of the individual populations. This will be of the form (xxi), or 




.(xlii). 


Here the quantity cr\iO-~~p^) has already been supposed constant = 7 for all 


populations. Our hypothesis to be tested is now that ^ is constant 

(I - p r) _ 7 


: for all 


populations. Putting y -= J ^ can suppose either (a) that 

^ i,i ^ 

varies from population to population, or (6) that = c* = a constant for all popula- 
tions. Let us see whether one or other of these hypotheses is the more probable. 
We may write (xlii) in the form 


ijM) 


r (i (j»f - 1 )) {ct‘ + (/8, - 


.(xlii**). 


(a) Since c< only occurs in the equation to determine its most likely value is 

JU-1 ... 1 


iM 


2c,* Ct‘ + (/3t-j8)» 

and for /3 occurring in all the k populations 

<=oc,* + (/S,-/3)» 

which by (xliii) gives, for the determination of )3, 




.(xliii). 


.(xliv). 


t=* 1 

S — 

^ 


0 


(xlv). 


Assume /So an approximate value of and let = i9o + where e is supposed 
small, then we find 


i^k 1 ji^k I 

S ^ / 8 

i=0 fff — »*=0 (fit — ^ 0 )* 


(xlvi). 


A good taking-off point is to assume l3o to be the mean value of the k fit's. This 
in our example is *529,02^ and substituting the values of fit in (xlvi) we obtain 
emm ^ *006,126. Thus /3 = /S© + € » *622,899, not a very great change on fio, and for 
practical pui*po8es needing no further approximation. We have now to consider 
the values taken by 

Y « 

' c^^(fii-fif' 
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but this equals in every case (Jlf — 1)/Af. Accordingly we have, since for i « 1 and 4, 
— /3 is negative, and for t « 2 and 4, ^ is positive, 

pi = />4 =* (3*5, J), 

and =* pa = 1 — J (3*5, J), 


as the probability integrals derived from the Type III curve 


ImiPt ?) being the incomplete Beta function. Looking up these values in the 
Table of that function, we find for (Jlf — *875, 


It follows that 
and 

Accordingly 

and 


as p 4 SB ‘175,3968, and pg » ps xs ‘824,6032, 
\n « log (pipap«p 4 ) = ~ 1*679,4666 

- 7 =^ , since n = 4. -1-932.176. 

va logioC 

Pa„ = / (1-932,176, 3) = *53949 
-46051. 


There is no improbability therefore in supposing each of our four sets drawn from 
populations having the same regression coefficient, but different values of the 
quantity = o-g* (1 — p*)/<7*i,,. 

It may be asked why we have not included the and terms in X, when 
maximising the probability of the total observed data. The answer is that the 
maximising of X with regard to ^ and causes the vanishing of the fistotor 
{a^ — ai + — )8)}* in X and so the other factor containing and ^ disappears 

with it (see our p. 241). 


(5) Now on the other hypothesis we have for the four c<**s 


Cl* = 066,643, ca* *081,263, cg* « *005,663, and cg* - *014,783, 


and these c^*’s do not at first sight look like the result of random sampling from 
a common value. But if they were, this would mean that <ri,^ would be the same 
for all four populations. The 0 * 1 , « values are given in the Table on p. 243, and 
d priori they do not look so improbable as a selection of four values of a standard 
deviation from four bivariate normal surfetces with the same 0 * 1 , that is to say four 
random values taken from the frequency curve 


N-/M 







This leads us to the most likely value for o*i*, namely. 


M 






M 

jyr-i 


(mean value of observed x variances) 


* 1656*6673 for our data, giving ax « 407-0218. 
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O'n the Weieh-Kolodan^czyk Frequency Surface 


Taking , 

Hence we obtain j>i « *478,364, px * *626,686, 


which give us 
and 


jt>2 = *666,666, 
JOB = *611,974, 
= ‘516,390, 
- 1*002,028, 
Px^ = *2013, 
Qa,='7987, 


p^ = *333,335, 
p^ = *388,026, 
p/ = *484,610, 

V * - 1*481,366. 
P;^,==:-4439, 

- 6661. 


Thus, according to the Pa„ criterion, there is no improbability in the four values 
of ax^i being random samples from parent populations having a common 
<7i - 1666*6673. 

Returning, however, to our different c^^'s and accordingly our different 
have to find for those s and the corresponding ^ (— *622,899), whether the result- 
ing four F/s given by 

F - 

provide an improbable set of probability integrals. We have just seen (p. 247) 
that they give 

“ 5395, = <2*„- - -4606. 


Comparing the latter with — ‘6661 as the more stringent series above, we see 
that to suppose the four surfaces to have different crx.tS is a somewhat more probable 
hypothesis than supposing them to have a common ax, but the probability of the 
latter is so considerable that it certainly would not prevent us from accepting the 
hypothesis that all four populations have the same ax, a 2 and p. 

It may be objected, however, that for the proper determination of ax we ought 
to have taken into consideration the general effect of fixing c and /3, and that we 
ought in determining ax to proceed by maximising first the combined probability 
with regard to o and /3. This we will now proceed to do. The curve for the Y 
variate for a single population is, if as before, c* = <T 2®(1 — p®)/<ri* and /9 =» pa^jax, 


v;;r(i(ir- 1)) {^+ - /§)•}*" 


(xlviii), 


and accordingly the probability to be maximiaed ia 

\r a (m - 1 ))P I {c* + (/8, - /3)»l*^ ’ 
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hence differentaating with regard to c and j8 we find 

c 1 {cr 

1 

M c* 


or 


1 * 


(A-/9)*r 

1 


and 


k 1 {^ + 03, -^)*} 


.(xlix), 


g( 

, Vc* + (A-/9)V 


.( 1 ). 


These are the equations from which c* and ^ are to be found. Clearly if M be 
large md /3i not very different from that is a-fi small (see p. 246), we can start 
with 0 « mean as a first approximation. 

Again, if Jlf be_ large then (xlviii) approaches closely a normal curve, and the 
constants c* and can be found directly without approximation*. When M is 
veiy small, as in the present case, we can only proceed by successive approximation. 
Let us suppose approximations oq* and /So have been found to and 13, and that 
+ and /9o4-f =iS, where f and f are small quantities. Neglecting their 

squares, our equations for finding f and f are from (xlii^) 


k 

S- 


i Co^ + (/3i-^oy 


k 

= (S 




k^-8 

M * 


Co' 




* 2co«(/3<-go) 

(Co* 


+ gg 
- floW* » 


(/9«-/3o) 


{c„» + (y8,-^o)*}* 
(A-/3o)*}* I {co*+(/S<-/3o)*}‘ 


(li). 


.(Hi). 


Co* + — ^o)* 

In our case, A? ~ 4, AT « 8, and the mean value of the four observed yS/s, i.e. 
/3i=.*43907, j9a = -63677, /98 = ’55719 and *48307, is 529025. We will start 
then with ^ * *53, which nearly corresponds to this value, and co* = *04, and 
compute our coefficients in (li) and (lii). We have 


(/3i^/3o)_ 

Co* + (A-/3o)* 

i {co* + (/9.~/8o)*}* 
S CO* 


k 

8 

1 

k 

8 


= - -2512,0918, 


= 9-5458,5438, 


Co* 


k 

?{^o* + (A-)So)*}* 


= 78-8686,5354, 


k 

8 


(A— ffo) 


= 3-5365,8033, k 


I {co*-l-(/9c-/3o)*l* 
M-1 


- 8-5823,6398, 




3*5. 


1 Co*+(A--^o)* 

Thus there result the numerical equations for f, f» 

- *2512,0918 » 69-3227, 9916f - 8*5823,6398 f, 

- -0365,8033 = ~ -6865,8912 f + 9*5458,5438 {*, 

leading to f = - 0041,3500, f « - *0041 ,2948, 

and thus to a second approximation 

/§ - ^0 + f = *53 - *004,135 = -526,866, 
c* « Co* 4- f * *04 - -004,129 - 036,871. 

* In the Atmalt of EugotUct, Vol. v. p. 406 et seq. I have dealt with a case of M large, but varying 
in magnitnde from one population to a second, replacing (xlviii) by a normal curve. 
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On the Welch- Kvlodziejczyk Frequency Surface 


It would be easy to improve these slightly by a further approximation, but 
they are adequate for our purpose. They show that the normal populations from 
which we suppose the samples ITi, 11*, ... 114 selected have a regression coefficient 
in common most probably of the value ‘525,866, which is strikingly near the value 
found for /8 on the hypothesis that the populations have independent cr^^’s — i.e. 
*522,899, Further these populations have, as we have seen, the same array variance 
cr** (1 — p*) ss 8903*3333 (see p. 245), but the value of 




_ (1 ~ _ 8903*3333 


Accordingly 


O'!* 

8903*3333 




•035,871. 


or o,- 498-201 *. 

Now this result is of considerable interest. It is of course not the standard 
deviation of the values of x considered in the experiments *, it is the most likely 
value of the standard deviation of x in the several normal surfaces with fixed o and 
^ from which the observations are supposed drawn. It seems to me that 0*1 has a 
most probable value, the moment we drop the unacceptable (v). But wo can now 
proceed further to find the probable values of and p. We have 

(Ti CTi* 

or <7*2 = 0-1* (c* 4- “ 248,204*21 ( 035,871 + *276,534) = 77640*2362, 

and cr, = 278*460,475. 

Accordingly 

p = /§ X (Ti/cr, = *525,865 x 498-201/278*460,475 = *94084 
gives the likely correlation. 

We have now ascertained the cti, < 7* and p of the most probable surfaces from 
which the regression lines have been ascertained. The next step is to consider 
whether /S*, jSs and are probable values to be drawn from the now known 
curve of distribution of This curve is given in (xlviii), where we can insert 
the probable values of c* and /3, The values^of are y&i — ^ — *086,795, 

^ = -f 110,905, - /3 = -h •031,335, ^34 - - 042,795. 

From these and *035,871, we have to calculate 


c» + (y9,-/3)« 


.(liii). 


giving vi= 173,663, w,- •265,839, t;,= -026,627, 048,676. 

By the above transformation of /3< to the proportional area of the distri- 
bution curve of /3 up to the value /Si from left to right, or the probability integral 

/>< = *{! 3-6)}, 

* The value of without any constraint on the four populations to have the same J§ and c was 
407*0218, which marks the considerable influence of the restraints (see p. 247). 
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where the plus sign corresponds to fii> 13 and the minus sign to /3i< 13* These 
values of the incomplete Beta function must be found from the Tables of the 
Incomplete Beta Function, using as sufficient for the present purpose linear inter- 
polation. But the Tables give /« (3’5, ^), so that we must write 


3-5) (8-5, i), 
or « 1 - (3-6, i), if 

= Ki-^(3-6,i), if/e,</9. 

We deduce accordingly the following values : 

p\ = (3*6, 0*6) ~ *13238, p\ ^ *867 62 

= 1 - (3-6, 0*6) == *91734, pi = *08266 

Pa - 1 - i/.v78.878 (3*5, 0*6) « *66182, pi = *33818 

P4 » (3*5, 0*6) » *28467, pi - *71643 

= - log (pj P 2 P 8 P 4 ) = ~ 1*640,7183, \I « - log {plplplpl) = - 1*760,6600. 
Dividing these values by Vti logto e = *868,6890, we have 

= 7(1*888,947, 3), = 7(2*027,036, 3), 

whence from the Tables of the Incomplete V-Function we deduce by linear inter- 
polation as adequate for our present purposes 


Pa„ = *6219, P;,^. = *6768, 

(2a„ = 1 - Pa, - *4781, <2^ = 1 = *4232. 


It is clear from these results that the four values of the regression coefficient 
provided might easily have been randomly selected from the distribution curve 
of samples of 8 drawn from normal bivariate surfaces differing only in their means. 
But we can hardly draw the inference that the distributions of the populations 
Til, Da, ITa, 114 accordingly normal, because the array standard deviations and 
the regression coefficients are capable of being considered random samples from the 
corresponding appropriate Type III and Type I curves. They, consisting of only 
four values each, may be shown to be equally random samples from normal curves 
or rectangles. But the method which, I think, would be legitimate when applied 
to larger samples can be illustrated on this example. 

We now turn back to formula (xxvii) for the bivariate surface of a and 
namely, 

N<ti (1 - P«)* W-») r {\M) -JM -1^5- 

V2w<r,*r(J(Jlf-l)) ® 

and e = + 

er% 0*2 

and the subscript % should be attached to all the letters when we wish to emphasise 
that we are dealing solely with the parent population 11^. 
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On the Wdch-Kolodzi^czyh Frequency Surface 


Now we have seen how to test the hypothesis, that the array standard deviations 
for the populations are the same, and further the additional hypothesis that their 
regression coefficients are all the same, i.e. that their regression lines are all parallel. 
To find the probable means of each population, we should have to maximise the 
only term in which and vn^i occur, namely, 

with regard to mi^i and mt^i. But these give 

fii (cLi — = 0 and Oi — m%i + ^iruu = 0. 

These, supposing to be finite, reduce to the one equation 

+ A = 0, 

and tell us that the position of the mean of the individual population is not fixed, 
but its most probable position is on the observational regression line of the tth 
sample. If, on the other hand, we suppose not a series of surfaces with the same 
afid p and variable means, but one surface with one mean Wi, rw*, then we 
have to minimise 

* ( «< - ■■ !? . « + = s {XifIM. 

1 Vt 1 


We will put this into another form. Let 5 = wa + iSmi, and 

(1 - p») + (A - ^)*| = <rx« + (/3, - ^)*] . 

Thus we have to minimise _ 

^ / (at - g + wti ))* \ 

iV c*+(/3i-j9)» } 


by differentiation with regard to a and mi. We at once obtain the equations 

1 -mj « 


k 

ss 


k 

as 


I c*+(/3t-/3)* 


St-S 

1 + ■■■*» c» + (/9(-/3)* 1 c*+(/3,-/3)* 


1 c* + (/9t-^)* 
* at(fft-/3) 


.(liv), 


to find a and »h, or, of course, nt, and mt. Actually we ought to substitute in the 
above equations the value of c and ^ found from difierentiating those quantities as 
they appear not only in T but in X also. But the resulting equations then involve 
four variables S, mi, c and ^ to be solved by approximation, and we will merely 
ask whether the likely values found for c and /3, when the populations have 
independent means, are compatible with a common mean. 

Collecting the data we require, we have 

c*- -036,871, /§ = -526,865, «ri-= 498201. 

/3i- -43907, A = -63677, _/3, = -66719, A- -48307. 

-- 086,796, A-i8 = + -110,906, A-/8 = + -081,326, A-jS =--042,796. 

09,- A* - 0076,3337, (A - » 0122,9992, (A -A* = 0009,8126, (A- A* -•0018,8141. 



Kabl Pkabson 


258 


Hence 

c* + (A-^)* 

1 

Pi-M 

c» + (A-i8)» 

Oi 
gf 

c* + (A-)8)* 

c*+(ft-i8)* 

g<(A-^_) 
c* + (/S«-/3)* 

Thus we obtain the summations, namely, 


(i=l) 

•0484,0437, 

(1=3) 

•0481.7092, 

«=8) 

•0368,5226, 

(f«4) 

•0377,0241, 

28039,154, 

20*769,413, 

27 186,881, 

26*623,502, 

- 1-9996,8337, 

+ 2*3023,2270, 

+ -8600,1581, 

- 1*1350,7327, 

- 2690896, 

-672*8414, 

+ 377-8726, 

+ 1963*1193, 

- 5969-2029, 

- 13967*7926, 

+ 10263-7170, 

+ 51803*6637, 

+ •1736,6247, 

+ *2663,3912, 

+ •0266,2686, 

-h 0486,7641, 

+ 61809696, 

- 1549*09803, 

+ 32119768, 

- 2216*93351. 


S — =- = 97 •457,450, 8 ^ g - ■ 

I c» + (/9.-/S)* ic*+(A-/8)* 


•0176,8187, 8 


8 


ai 


< + 42120-2853, 8 


Tc*+(A-/9)* ■ 1 c* + (/3,-/9)* 

and our equations for S and mi take the forms 

97*467,4505- *0176,8187 mi* 42120*2853| 
•0175,81875- •6041,0386?ni = - 2926 73690J 


1 

« - 2926*73690, 


. *5041,0386, 


.(Iv). 


Solving these equations, we find* 

a * 433*2416, mi = 6820*9316, mg == 5 + mi/§ * 3494*2658. 

If we examine the Table on p. 243 these values do not look unreasonable. But the 
problem is: Will they give reasonable values for Xil Accordingly we proceed to 
compute Xi by insertion of 5, m^, c, ai in (xxxiv). We find 

Ai*-32*6340, X,*-ll*912l, Aa*4-3*7560, A* = -h 37*1566. 

It is idle to work out for four such values supposed to be selected at 
random from a normal curve of standard deviation unity I The probability is 
enormously against them. 

We conclude confidently that the four sets although they might be samples 
from a single shifted normal population could not possibly be drawn from the same 
normal population fixed in position. Thus (5) of p. 228 is absolutely excluded in 
this case. 


* I have endeavoured in (he course of (his paper to give some evidence of (he na(ure and amoun( of 
ari(hme(ioal work required, as this seems (o me an essential need for judgment of the value of any 
statistical process. 
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On the Welch- Kolodziejczyk Frequency Surface 


We may now proceed to further hypotheses. We may ask, admitting that the 
four samples could have been drawn from the same normal surface with the values 
of cTi, c and ^ determined on pp. 245 and 249, shifted parallel to itself, whether 
the means of these surfaces mi,i, ni^^i could be reasonably supposed to lie on one 

and the same straight line 

^ y^L-\^Kw. 

We have first to determine the most probable value of L and K by minimising 


1 X + 


.(Ivi), 


where c, /9 and cri have the values already determined. 
This may be written 


The problem now is: What value shall we give to 

On pp. 147 — 8 of his memoir, Mr Welch appears to give the x mean of the 
ith population the value of the sample mean x^, I am a little doubtful if this is 
admissible. He writes : 


“ It will be necessary to express algebraically the condition that the population means of the 
groiii)s [niy and mt,i\ are collinear. If Xt and yt are the means for the n< observations in 
the <th group and the symbol [ ] is used to denote expected or population value, then 

2 , ... k) 

and the condition that the point {x^y [y^]) lies on the line 

is that + + m (^ = 1, 2, 

In my notation [y^] = ^ priori = x^y my Wc have the 

condition that 

rn^^i «* i + 

Further, by our (iii), 

and, if we assume all possible samples taken from the parent population, this 
leads to 

= + 

and not to = «< -f 

unless we assume that 

Now the distribution of x^ and is given by 

f (Ivii), 

where Zq does not contain or Hence if we maximise the probability of x^ 

and y< occurring — the parent-population means and being unknown and 
not linked together — we’ find the most likely values of and are those of 
the sample, i.e. and y<; but if we introduce a bond between and this is 
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not the case. For example, if we make ■■ 
above, and then minimise 

(^i -«»!.«)* 2p - mi,i) iVi-L- 






= L -f Kmi^i substituting this in (Ivii) 


. (yi-L-^KmuiY 


2,i 


with regard to this leads to 






Accordingly will not be equal to unless either the slope of the proposed line 
of population means is equal to the slope of the ith population regression line (not 
the regression line of the sample), or the line of population means actually passes 
through the mean of the tth sample. As these are very special cases, we may 
assume that in general most likely value when it is linked to is not 

It is quite true that: (a) the k populations may not be normal populations, 
although if we feel unable to accept (v) they narrow down to something very 
close to it, owing to the form (i) of the population surface selected ; and (b) the 
sample may not be a random sample from the tth population — the a?-values may 
be directly selected; still must correspond to one of the possible samples 

from the tth population*, and we think (Ivii^*) will give a more likely value for 
than taking rrii^i equal to 
Now we have seen (p. 252) that 

' {c<* + ” ^<)*} 

= s ~ ~ ^<)l* 

* {c<* + (^i - A)*} 

Or, substituting from (Ivii) for the most probable value of nii^i — we have, since 


1 










s (X*)^8 ^ (y« - ^ \C,» + (K- 8J) 

or, as we may write it, 

^ I J^Cj^ + (K ^ ff,) (13, ^ ff ,)}« 

1 {c,* 4- (0, m [c,^ + (iT - 

(Iviii). 

The most general problem would be one in which we should maximise both 
the Xi and F< functions for L and Jf. But this leads to extremely compli- 

cated equations, and accordingly we will suppose C| and /S< are the constants 
determined on pp. 245 and 249, which give reasonable normal surfaces for the 


* I think in the actual example choaen, the values of x are more or less a random selection, and 
accordingly Xi , pi have random values. But if they are not, and it has been suggested that they might 
be taken at one end of the surface, is there any logic In assuming tliat a single probability integral (Welch) 
of a combination of such integrals, which are interpreted as random measures, will really provide a 
measure of the possibility of an hypothesis, and not of the selected values of :r? 
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k populations. What we have then to do is to minimise 8(X^) with regard to L 
and K — Accordingly we start from 

0 / y *x _ ^ o (y< -paij-L- KXjf {c* + «c (ff - 8i)\* 

1 * 7* 1 (c» + *»)*{c* + (/8-/3*)*l 

The differentiation with regard to L gives us, after dividing out the terms in- 
dependent of i, _ _ 

S ~ Kx^ (c* + < (8i ^)1* ^ Q (lix) 

1 


and for k, after again dividing out terms independently of t, 

1 ■” “ _ >+(/9,-|8)* 

^ jo* + ^ ( ffi - 

1 _ c* + (/3,-/3)* 

_ 2« * (g< -8^i-L- Kiotf (c* + tc (8t - 8)] * ^ Q 

v' + ^i c» + (/3,-j8)* 


(lx). 


We will now assume that approximations to L and tc or K are known, and will 
take L = Lo + ^, «=«o+?j thus neglecting squares and products, we have from 
(lix), 

s (Si-Lo- Kox,) { c« +_/ro(;9, -8)? ^ {c* + ^,(/3, -/3)}» 

1 ' c^ + {8i-8f ‘ 

. . f £ + _ * 2 (y, - jc^ -t- (/3, - )i (/3, - 

^Li c^ + {8i-'^iy ‘ c^+i8i-8f J 

(1«)- 


We have now to arrange (lx) in like manner — 

0 (y< — Aq — Kf,X() }c* + Aco (/8< - ^)1 ^ _ q --^0 ~ <)* {c* + «o - ^)1 (^< - 

1 c»-KyS, ^ 1 . _ c*-K/3,-/3)* 

2<icb ^ (y < -Lq- Ko ^t Y (c* -h <co {8 i - ^)1* 

■^c»+/ro®i ■ c* + (/y<-^)* 

= e [i ?l.(£“ + ^ (/3<-^ )r _ 2 s \< ^ + >c„(_8t-8)\ (fii-M) 

^Li c*+'(8t-'8y » c* + (/3,-/8)* 


+ « fc U-KqX,) [c» + iro(/3«-j3)r ] 

c*+*o* I c* + (/9,-/3)* J 

r * lc» + «, Ot - ^)1» * g, (y, - Ao - AoX,) {c« -H ACO j/3, - /§)) (8, - 8) 

Hi c*+(/8,-i8)» c* + (8i-Py 

* (y,-U-KoX,y(8i-ffy . 4 >ko ‘ a;, {y^-L,,- Kpx,) [o^ + (8i - fi)\* 

1 c* + (/3,-/3)* ■^c* + *o*t_ c* + (,8i-fiy 

_ 4 >IC 0 ^ (yt — Lq — AoX<)* (c* + ACq - 8)} 

c* + ACo* 1 c*+(/ 3 ,-^)* 


2 (c«-aco«) * iy^-Lo-K^x^)'{c* -Mro P)? 

(c* + ACo*)* 1 ca4.(^^_^)* 


(Ixii). 
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The problem is now how to find approximate values of Xq and «« JTo — /9, 
which may be inserted in the above equations Let us see whether Mr Welch’s 
assumption mi,i » Xg will serve as a starting point. We have from (Ivi) 


S(X,*) 

1 


^ Af («« - 

1 5*i.«(c«* + (/3«-A)*) • 


Putting = i + Xmi,i = Z + Kx^, ^t.i “ =• and Of = c, we have 


1 


Mc*Uv,-L-Kx,'f 
^ 1 c* + (A-/3)» • 


Difierentiating this result with regard to L and K we have the following relations 
to determine those quantities after dividing out quantities independent of i : 


* (y,-Z-igg,) ^ {y^-L-Kwpw, ^ 
1 c* + (/9,-/8)* ’ 1 c« + (^,-/3)* 

These lead to 


Jl. 


* 

.Z«S 


sr. + iTS-r 




ic* + (^,-j8)* ic* + (A-/8)* , o* + (/3,-/9)* 






■■L8- 


k 




1- (Ixiii). 


ic*+(/8,-/8)* 1 c» + (,8,-/3)* • ■■ 1 + 

delightfully simple equations after those we have found for a more probable value 
of 

Substituting the values given for x^, and on p. 243 and for c* and on 
p. 252 we find 


k 

8 


7 < 


1 c» + (/3,-/9)» 


: 381,806*66114, 8 


1 C* + (^4 - 8) 


- = 97-467,450, 






and 


- - 640,114-12926, 8 , 

1 c* + (;9,-/3)* 1 c* + (<S,-)3)« 

It 


- 45,8190,8819-95, 


-S 


«*y< 


: 28,5448,6571-56. 


.(Ixiv), 


Thus we obtain the equations 

381,806-66114- 97-467,450Z+ 640,1 14-12925/f) 

28,6448,6571 66 - 640,1 14-12925Z + 45,8190,8819-95iir j 
which provide the solutions 

Z-- 2114-2489 and JT- -9183,6118, 
and the most likely lines for the means 

y - -9183, 6118® - 2114-2489. 

We have now to consider what is the probability of the resulting set of Xt’a. 
We have 

® Vt — Z — JXSf 


X,. 


7 + 
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On the Welch- Kolodidejcxsyk Frequency Surface 


Here Jf= 8 . c* = *035,871, ^ = *625,866 aud 7 = 94*3576: 

see pp. 250 and 246. Thus 

Y - 005,67728 (y, + 2114 2489 - *9183,61 18gf) 
V*035,871 + (Mi - *526,686? 

whence we obtain 


JTi = + 4-7702, Xa « - 6*9129, iTa - - 4*1473, Z 4 = 5*8650. 

These values as a random sample from a normal curve are most highly improbable 
— we do not need to calculate their — and accordingly if we adopt the hypo- 
thesis it is clear that the means of the populations cannot lie on a straight 

line. Our problem, however, is whether the values thus found, i.e. 

Xo » - 2114*2489, ifo = *9183,6118, 
and accordingly /tq = JSo — /8 « *3924,9618, 

will be suitable approximate values for the solution. The actual arithmetical 
work is not so laborious as it appears, the summations often repeating themselves. 
We find (Ixi) gives 

- 41*8567,47095 = *2035,5520f + 1458*100,055?, 
and (Ixii) - 226,852*3567 = - 24767*384,440f + 850,2270*6475,7242?. 

As solution of these equations we find 

f = - *6633,5362, ?= - *0286,13755 (Ixv). 

These denote a very small change in X, and a fairly small change in K or k, so 
we shall not go to a second approximation. They give us 

X=- 2114*9122,5362, if = *8897,4742 and /c = *3638,8242, 


or the most likely line for the means of the four populations is 

y * *8897,4742^ - 2114*9122,5362 (Ixvi). 

We have now to discover whether this line gives a reasonable confirmation of 
the hypothesis that the four populations means really lie on it. For this purpose 
we must compute the four A/s from the formula 

y •JM c ^ -L - {c* + /t (;3, — /9 )} 

7 + 

= .na .^7 V (yi + 2114 91 225-*8897.4 742^, ) (•035,8 71 + *3 638,8242 

* ” ’ " V*036,8fi + ()8,-*BH865>* 

and then see if they are a random sample from a normal curve of standard deviation 
unity. The values of are 

Zi = + *191,657, Z, = - 1*089,254, Z* = + *673,000, Z. = + 1*606,296, 


giving 

pi = *576,9942, log p, - 1*760,4181, p,' = *424,0058, log p,' = 1*627,3719, 
p, = *138,0218, log p,- 1*139,9476, p,' = *861,9782, log p,' = 1*986,4963, 
p, = *716,6767, log p, = 1*856,3227, p*' = *283,8243, log ps' = 1*452,2838, 
P 4 = *945,8936, log p* - 1*976,8422, p/ = *064,1065, log p«' = §783,2494. 
X„ = - 1 268,4694, X„' = - 2*251,6986, 
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and hence 

= / (1-460,3793, 3) « -3361, Pj^. « / (2-592,2486, 3) » -7698, 
or Qa^»-6649, Qa»«-2402. 

Qam' is the more stringent, but we see that it is quite compatible with the means 
of the populations being co-linear. 

Summary, (a) The hypothesis that the four populations are normal and have 
a common array standard deviation can be accepted. This is opposed to Mr Welch’s 
conclusion that the latter must be rejected, and is curious considering that our 
assumption of normality is more stringent than his assumption (i). I attribute the 
difference, supposing the arithmetic of both of us to be correct, to his assuming 
for the distribution of his compound criterion a curve which is probably not the 
true curve. In my case the hypothesis of normality leads to the exact curve. 

(6) 1 find no improbability in the four normal populations having the same 
regression coefficient. This is in accordance with Mr Welch’s conclusion. This 
result can be expressed by saying that the four populations have the same values 
. for Vl -p\ ^ and 

(c) It is highly improbable that the value of is the same for all populations ; 
in particular that they are all selections from a single normal surface. 

(d) Mr Welch finds that the means of the four populations are unlikely to be 
collinear. I find that if I use the values he adopts for the ryix,/s — namely, to take 
them equal to the sample means — that I agree with him. But I doubt the accuracy 
of this equality, and by taking what I think is a more likely value of 1 conclude 
that the hypothesis of collinearity need by no means be rejected. 

(e) For such small samples as are here under discussion the conclusion that 
certain hypotheses are not markedly improbable does not seem to be a justification 
without further experience that they are true. My method has been to combine 
the probability integrals deduced from the k populations and determine whether 
they are a fairly probable random sample of k points from the appropriate curve 
of distribution. Mr Welch’s method is to form a combined criterion, and find the 
probability integral of this combined criterion on the assumption that it follows an 
empirical curve. In either case with merely four points, we should get equally 
probable results had we taken other arbitrary curves for my appropriate curves, 
or for Mr Welch’s empirical curve. It requires not four, but frequently one or two 
hundred points to make a selection markedly more probable when taken from one 
curve rather than from another. Hence it seems to me unjustifiable to adopt from 
four samples of eight individuals each any conclusion as to the algebraical forms of 
the populations sampled. 


17—2 
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(i) Constancy in the Number of Ugulate Flowers of Chrysanthemum 
laeucanthemum, variety Pinnatifidum, during the Flowering Season. 

By william DOWELL BATEN 
(University of Michigan). 

Many plants produce flowers at the beginning of the flowering season which differ significantly 
from those pi*oduoed at the end of the season. Yule* found that the average number of sepals 
in flowers of Anennone netnorosa differed significantly for two periods of the same season. 
Burkillt shows by his counts that the number of stamens in Stellaria media varied greatly 
during the blossoming period. S{)ecies of Salpiglossis produce rather large blossoms at the 
beginning of the season and very tiny cleistogamo\is blossoms at the close of the season The 
average numbers of pistils and stamens of Ranunculus ficaria differ widely during the time of 
blooming §. Shull’s || samples of Aster prenanthoidea Muhl. show that the number of bracts, the 
number of disk flowers and the number of ligulate flowers differ significantly for different times 
in the blossoming period. TowerlT found there was a significant difference between the average 
number of ray flowers per head of Chrpaanthenmm leucanthemum for different periods of the 
flowering season. 

The object of this study was to determine whether or not Chrysanthemum leucanthemum^ 
variety pinnatifidum, exhibits any significant differences in the average number of ray flowers 
per head at different times during the flowering season. 

Material. 

On May 17, 1932, a sample of one thousand flower heads of Chrysanthemum leucanthemum^ 
variety pinnatijidum^ was gathered at random from a plot of about one -half acre, located about 
four miles east of Dexter, Michigan, along the Huron River Drive. On this date there were 

* Yule, G. U. ** Variation of the number of sepals in Anemone memorosa,” Biometrika^ Vol. z. 
(1901--2), pp. 807-809. 

t Burkill, 1. H. ^*On some variations in the number of stamens and oorpels.’* Joumul of the 
Linnean Society^ Vol. xxxi. (1895), pp. 216 — 245. 

t Dale, E. E. Unpublished results of study made in the Botanical Gardens of the University of 
Michigan. 

g MacLeod, J. ^^Over do oorrelatie tussohen het aantal meeddraden en het aan|al stampers bij het 
Speenkruid {Ficaria ranunculoideay' BotanUch Jaarboek^ Jaargang xi. (1899), pp. 91 — 107. **Ooopera> 
tive investigations on plants. II. Variations and correlation in Leaser Celandine from divers localities.” 
Biometrika^ Vol. u. (1902 — 8), pp. 146 — 164, Miscellanea. Change in organic correlation of Ficaria 
ranunculoidest^^ Biometrika, Vol. i. (1901 — 2), pp. 125 — 128. 

II Bholl, G. H. *<A quantitative study of variation in the bracts, rays and disk florets of Aster 
ahortii Hook., A. Novae-AnyUtp L., A. puniceua L. uzd A, prenanthoides Muhl. from Yellow Springs, 
Ohio.” American Naturaliatf Vol. xxxvi. (1902), pp. 111 — 152. 

U Tower, W. L. Variation in the ray- flowers of Chryaantkemum leucanthemum L. at Yellow 
Springs, Green County, Ohio, with remarks upon the determination of Modes. ” Biometrika^ Vol. i. 
(1901^2), pp. 809—816. 
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maany beads in bill bloom^ with cximparativelj few heads having gone to seed. There were 
thousands of buds on the plants, showing that this sample was gathered at the beginning of the 
flowering season. 

On June 4, 1932, another sample of one thousand was taken at random fW>m this same plot. 

Another sample of one thousand heads was gathered from this plot on June 23, 1932, at which 
time there were scarcely any buds, while there were thousands of old heads in seed. A few 
days later there were not enough blooms for another sample. It seems plausible to assume that 
this sample was gathered at the very end of the flowering season for this particular plot. 

The plants were growing veiy close together in an open place well elevated and well 
drained. The area was completely covered with daisies with a scattering of uncultivated rye. 
There was an extraordinary abundance of the species growing here. 

The sample taken at the beginning of the season will be designated by Sample A, the one 
gathered in the middle of the season by Sample B, and the last by Sample C* 


Analysis of the Samples, 

The following table gives the frequency distributions of the number of ray flowers per head 
for the three samples : 


Number of ray 
flowers per head 

Sample A 
Frequency 

Sample B 
Frequency 

Sample C 
Frequency 

Sum of 
samples 
Frequency 

11—12 

0 

0 

3 

3 

13—14 

131 

168 

139 

428 

15—16 

242 

239 

241 

722 

17—18 

238 

263 

240 

741 

19—20 

229 

206 

232 

667 

21—22 

146 

117 

128 

391 

23 -^-24 

10 

7 

17 

34 

26—26 

3 

6 


9 

27—28 

0 

4 

0 1 

4 

29—30 

1 

0 

0 

1 

Totals 

1000 

1000 " 

1000 

3000 


Histograms in Fig. 1 show how the sample at the beginning of the flowering season 
compares with the sample at the close of the season. The general appearances of the two are the 
same while the ranges are nearly the same. Both rise abruptly for the class 13 — 14 ray flowers, 
have about the same heights for the three middle dasses and drop suddenly for the class 23 — 24 
ray flowers. 

The significance of the means is 

Difference of Means ^ 

Probable error of Diflerenoe of Means ’ 

which indicates that the diflerenoe of the two means is well within fluctuations due to random 
sampling. Vertical lines near the middle of Fig. 1 represent the means for the two distributions 

* The author is indebted to W. H. Long, a student in the Forestry Department of the University 
of Michigan, for the last two counts. 
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and show the small difference between them. The difference between the averages is not 
significant and is no doubt due to chance. The average number of ray flowers per head did not 
change appreciably from the beginning of the flowering season to the end. 

The following table gives the chief characteristics* of the distributions of the number of 
ray flowers per head for the three samples, and for the sample made up of the sum of A, B and C. 


Sample 

Mean 

Standard Deviation 

Skewness 

Median 

A 

17 *630 ray flowers 

2*64482 ray flowers 

•237819 

17*67 ray flowers 

B 

17*400 „ 

2*6844 „ 

*358392 

17*29 „ 

C 

17*622 „ 

2*62364 „ 

•108746 

17-48 

For sum 

17*6167 „ 

2*6062 „ 

•217802 

17*438 „ 


The means of the three samples differ by amounts which are less than three probable errors 
of the various differences of the means and hence are well within fluctuations due to random 
sampling. This fact clearly indicates the constancy in the average number of ray flowers per 
head. These means are represented by the vertical linos near the middle of Fig. 2 (p. 264). 

Slight differences between the medians of the three samples exhibit also the constancy of 
the numlier of ray flowers produced by this species. Their differences are within sampling 
fluctuations. The standard deviations, which show how closely the items of the distributions 
are scattered about the means, indicate here the similarity of the three distributions and the 
small differences in the probable errors of the means. Differences between the standard deviations 
are not significant and indicate that the samples were no doubt taken at random from the same 
parent population. 

The skewnesses for the distributions do not present any significant differences. 

Fig. 2 (p. 264) exhibits frequencies of the samples taken at the beginning of the flowering 
season, at the middle of the season and at the end. All three distributions contain few heads 
which had less than 13 ray flowers and a very small percentage which had more than 25. The 
bulk of all three lies within five classes which have frequencies differing slightly. The central 
graph differs slightly from the other two but only by differences which are duo to fluctuations 
in sampling from the same parent. 

The above study for this species in the ctomposite family clearly shows that the numl)er of 
ligulate flowers per flower head remained constant throughout the flowering season, and that 
different climatic conditions which existed during the season did not affect the average number 
of ray flowers per head. 


Analysis of entire Set of Ligulate Flowers. 

Since the first three characteristics of the three samples did not differ significantly, a large 
sample was formed by combining all three distributions. The chief characteristics of this sample 
for the entire season are given in the table on p. 266. Fig. 3 (p. 265) exhibits the histo- 
gram for this distnbution. 

* The corrections for these characteristics were made according to the formula for correcting the 
moments for discrete variates. AnnaU of Mathematical Statistics. February 1980, p. 111. 
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The frequency polygon in Fig. 4 (p. 266) shows that there are modes at 16 and 21 as 
Ludwig found in his sample of this species*. 


* Ludwig, F. **Ueb«r Ysriationskuryen und Variationtfiiicben der Pflansen.'* BotanUeht$ Central- 

hlatt, Bd. 64 (1695), B. 1—11. 
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(ii) Strene^h of Vision in relation to Elye and Hair Piflrmentation. 

By CATHERINE M. THOMPSON and ETHEL M. ELDERTON 
(Written up by K. PEARSON). 

Many years ago Francis Galt<)n raised the question* of whether it was not desirable to select 
candidates for the Civil Service not only by their intellectual powers and the capacity to satisfy 
a medical examiner, but, especially in the case of tropical appointments, by physical characters 
which do not enter into the customary medical inquiry. For example, was it more profitable to 
send young men of light or dark complexion to hot countries ? What hair 'and eye pigmentation 
were more suited to India or again to Central Africa ? 

Last year my colleague. Professor Fawcett, asked me with a like end in view whether I knew 
of any data associating vision with pigmentation. Besides much data not yet reduced taken in 
the Anthropometric I^Aboratory at University College, London, I had on loan and in course of 
reduction data from the Cambridge Anthropometric Laboratory. This latter material offered in 
the case of some 1500 young men, between 18 and 22 years of age, the moans of determining for 
one visual character, strength of sight, the relation to pigmentation. It did not, however, provide 
the effect of bright light of various intensities on modifying this relation of sight and pigmenta- 
tion, and for this reason could scarcely be of servnce for Professor Fawcett’s purpose. The results 

* See The Life^ Letter* and Laboura of Francis QaUon^ Vol. ii. pp. 886 — 396, especially p. 896, and 
Vol. 111 . p. 220. 
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appeared of sufficient interest in themselves to deserve publication in the Miteellansa of this 
Journal. The material was extracted from the Cambridge data and tabled by Miss Catherine 
M. Thompson, a student in the Department of Statistics in Univemity College, London. The 
constants were then worked out by Dr £. M. Eldertoii, and partly, under her supervision, by 
Miss Thompson. The Editor is solely responsible for the arrangement of the material and the 
oonoliisions drawn from it. He would state, however, that the Right Eye is known from other 
investigations to be more long-sighted than the Left. 

The visual character taken was the distance at which diamond type could be read without 
glasses. That distance was measured in inches. 


TABLE I. Strength of Vieion and Hair Tint, 


A. Bight Eye 

Number... 

Mean Distance ... 
Variability (s. d.) 

Standard Error of Mean 

Bed 

Fair 

Brown 

Dark 

Jet 

Black 

Total 

Population 

70 

54"*07 

16"*00 

1"*91 

279 

66 "-ie 

16"*36 

0"-92 

355 
66 " *80 
15"-32 
0"*81 

1 

784 

66"*42 

16 "- I 6 

0"-64 

14 

63"*67 
14"*22 
3 " *80 

1602 

55"-62 

16"*29 

0"-39 

B. Left Eye 
Number... 

Mean Distance ... 
Variability (s. D.) 

Standard Error of Mean 

1 69 

! r ) l "-92 
15"*60 
1"*88 

1 

I 277 
62"-68 
16"-31 
0"-92 

347*5 
54"*49 
14 " *09 
0"*76 

789*5 

6 . 3"*91 

14"*66 

0"*62 

14 

64 " *29 
14"*14 
3"*78 

1497 

53 " *73 
14"*71 
0"*38 


Examining the standard errors, wc see that no two eye-strengths are significantly different 
except in the case of the total populations, where the difierence of Left and Right Eye strengths 
— 1"*891 with a standard error of 0"*648, or the difierence is 3*46 times its standard error, which 
would only occur thrice in 10,000 trials, if Left and Right Eyes were samples of the same normal 
universe. It would l>e seen that the Right Eye for each hair-colour is stronger than the Left Eye 
except for Jet-Black Hairs, of which there were only fourteen. If we take the difference for the 
Right Eye between Fair and Brown, it is l''*608, with a standard error of 1"'227. Tlius the 
difference is only 1 *31 times its standard error, and would be found to be as great or greater in 
9*6 per cent, of trials. If we add all Right and Left Eyes together for Red and Fair, and again 
for Brown, Dark and Jet Black, we got: 

Mean distance Variance (tr^) 

Fair and Red Hair ... 53"*752 239-733,774 

Brown, Dark and Jet Black 54" *661 220*882,399 

Excess Dark Hair-»0"*899. 

standard Error of Difference- + g20^|gg9y 

\ o96 2304 / 

=:0"*6639. 

Ratio of Difference to Standard Errors 1*354. 

Thus, such an excess in Dark-Hair vision might be expected to occur in about 8*8 i>er cent, of 
trials, if no such excess really existed. This is hardly better than the previous test for Right Eye 
only. No stress can be laid on it. At the same time the general run of stronger sight is the 
same for both eyes, and we have some little evidence that Rod and Fair Hairs have weaker sight 
than Brown or Dark, although the differences are too small to be significant on the present 
numbers. 
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Table II gives the corresponding data for three eje-tints. There are three classifications : 
Kight Eyes, Left Eyes, and both eyes when the Left Eyes have had their strength increased up 
to Right' Eye strength by the mean difference. Here we naturally find again the same significant 
difference between the total populations of Bight Eyes and Left Eyes. If we consider the 
difference l)etween the strengths of Light and Dark Right Eyes, it is 4"*607 with the standard 
error 2" *033, or the difference is 2*27 times its standard error, which would only occur 12 times 
in 1000 trials, if the lioffc Eyes and Right Eyes were samples from the same normal universe. 
On the other hand, comparing Fair and Dark Left Eyes the difference is only 2''*948, with a 
standard error of 2" *454, or the difference is only 1*20 times its standard error, and would occur 
in 11*6 per cent, of samples if the Fair and Dark Left Eyes were members of the same universe. 
The fact is that the Cambridge category “Light Eyes” was either badly defined or poorly 
distinguished ; there are far too few cases in it. (Of course in more modem inquiries eye-scales 
of 12 to 16 tints are used.) 

TABLE II. Strength of Vision and Eye Tint, 


A. Right Eye 
Number... 

Mean Distance ... 

Variability (8.D.) 

Standard En’or of Moan 

Light 

Medium 

Dark 

Total 

Population 


992 

55”*84 

16”-24 

0"*48 

499 

66”*11 

15" *64 
0"*70 

1618 

65"*67 

16"*28 

0"*39 

B. Left Eye 

Number... 

Mean Distance ... 
Variability (s.n.) 

Standard Error of Mean 

27 

6e”*57 

12”*32 

2"*37 

986 
63”*77 
[ 16”*02 
0"*48 

502 

63"*63 

14"-26 

0"*64 

1514 

63"*77 

14"*73 

0"*38 

C. Bight Eye + Left reduced 
Number... 

Mean Distance .. . 
Variability (s. d.) 

Standard Error of Mean 

64 

69"*10 

11 ”*20 
1"*62 

1 1977 

66"*76 
16"*30 
0"*34 

1 1001 
65"*32 
16"*10 
0"*48 

3032 

55"*67 

16"'18 

0"*28 


It is at once obvious that there is no distinction in strength of vision between the categories 
of Medium and Dark Eyes. To thit>w more light on the subject the Left Eyes were modified to 
Right Eyes as explained above, and now from Class C the difference of strength between Fair 
and Dark Eyes is 3”*777, with a standard error of 1”*697, which gives the difference 2*36 times 
its standard error. Such a difference would occur only nine times in 1000 trials if Light and 
Dark Eyes belonged to the same visual universe, or, there is a very considerable degree of 
probability that Fair Eyes can see farther than Dark Eyes, i.e. the latter are more short-sighted. 

These results throw, of course, very little light on the problem of the complexion suitable to 
tropical countries. But it is of interest to have the superior vision of the Right Eye confirmed, 
and further to learn that Light-eyed persons are more far-sighted than Dark-eyed ; while on the 
other hand there is some evidence, but far from conclusive, that Dark-haired persons have 
stronger vision than Light-haired persons. According to this suggestion the strongest vision 
would be found in persons with dark hair and blue or grey eyes. 
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(ill) XTot* on the Sampling IMstribntlon of Jfix, wliwo tbo 
Population is Xf ormal. 


By P. WILLIAMS, B.A. 


In a previous issue of this Jowrnal Dr £. S. Pearson has published a table containing 
approximate 5**/, and I limits for the distributiona of »J^x j9a when sampling from a normal 
population*. The results were based on the quadrature of empirical curves (Type Vll and 
Type IV of K. Pearson’s system), the values of whose moment coefficients were obtained from 
expansions in series of inverse powers of the sample size, carried as far as terms in Later 
work by Professor R. A. Fisher has made it possible to obtain for exact values for the 
sampling moments up to /xgt. These may be written as follows: (1), (2) and (3) are given in 
Dr Pearson’s note it and (4) in a paper by J. Pepper§. 

6(n-2) 






/*•* 


■3 + 


(« + l)(»»+3) 

30(»-7)(n*+2f»-6) 

(»-a)(»H-0)(«+7K» + 9)‘ 


•( 1 ), 


.( 2 ), 


» / , 540{»7+60n*-131m‘-2798n«-362»»»+21362n* + 32843«-70t)70} _ 

(n-2)*(n+6)(»+7)(w + 8)(«+ll)(» + l'3)(n + 16) 

■®a 

M2 


105 (w4- 1)» (w-H3)3 {n6+ 171?i&+ 13893»*4-580401»3- 51 3101 471* + 14 132268»~ 12932920^ 
* (n-2)»(TC + 5)(n + 7) (n + 21) 


(4). 


Ill a short note Dr Pearson { has shown that the approximate expressions he had used for the 
moments differed only slightly from the true values; at n.«»50 : was correct as far as the 

fourth decimal place, while the differences in were as follows ; 

7i=s50 n»75 »=t=100 

True value ... 3 452 3'351 3*284 

Value used in making tables 3*46 3*35 3*28 

This disagreement is not large, but it seemed worth while to recalculate the corresponding 
Type VII curves, and also to make use of Professor Fisher’s I'esults to investigate the question of 
adequacy of approximation a little more fully. In the first place the Type VII curve was refitted 
for n«50 (the smallest value of n considei'ed by Dr Pearson), using the true moment ooeffioients; 
the 5 7. 1 “/o liniits were found to be identical, to three decimal places, with those previously 

given. 

The Type VII curves have been used as an approximation to the unknown true distribution ; 
their first four moment coefficients have the correct valuesjl, but their higher even moment 
ooeffioients will of course be in error. The comparison in Table I (p. 270) is of interest. 

At n«60, JBg and for the Type VII curve are so close U» the true values that the 
approximation may be expected to be good. At nan 25 the difference is especially laige for Bb> 
Since it is likely that a frequency curve with the correct first eighi moment ooeffioients would be 
nearer the true distribution than the Type VII curve, it was decided to make a frirther com- 


* Biometrika, Vol. xxn. p. 248. The tables are reprinted in TcMe$ for StatUticianB and Biametricians, 
Part n. Table XXXVII^, p. 224. These limits correspond to the points beyond which the chances are 
*05 and *01 of a sample value falling, and are useful in forming a rapid judgment on the significance of 
departure from normality. 

t Proc. Roy. Soe, A, Vol. 180 (1980), pp. 16—22. 

Biometrikut Vol. xxn. p. 428. 9 Biometrikax Vol. xxiv. p. 60. 

iJ Since the Type VII curve is symmetrical, all its odd moment ooeffioients are necessarily correct. 
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TABLE I. 


le size 

True values from 
equations (3) and (4) 

Values for Type VII curve with 
correct and B^ 


^4 




15 

25-4803 

297-0076 

26-4371 

359-2390 

0 

23-4203 

267-4391 

23-3940 

269-8547 


pariHon at w = 25, by iiwing one of the HaDsmann system of symmetrical curves* Those curves 
make use of the moments as far as 

Taking the numerical values for and given in Table 1, and also the values 

= -435 386, i?2 = 3-578 325 

oi)tiiined by putting w=25 in equations (1) and (2), it was found on applying Hansmann’s rules, 
that his Type (i) curve is the appropriate one. Its ecpiation takes the form 



where i- , and 

‘ (q -98*6732, p =9-21640, 

' 5-37496, ^0=0-42492. 


On a))plying (juadratui-e to this curve, the 5®/^ and 1 limits for v'^i are found to lie at 0-711 
and 1-061 respectively. Using the Type VII curve the limits lie at 0*710 and 1*050. The fonner 
l)air are presumably nearer the true values than the latter, but the diflfbi-ence is of no great 
im]>ortancc. In fact as n increases above 25 and approaches the value of 3t, we may 
anticipate that the Tyi)e VII approximation U) the unknown true sampling distribution will 
steadily improve. Dr Pearson’s table has therefore been extended in Table 11, using the Tyjio VII 
approximation, except in the case of w = 25 whei*e Hansmann’s' curve has been used. 

The interpretation of these limits is as follows. If, for example, a sample of 25 is drawn from 
a normal population, the chance is *05 that will lie above -711 ; it is also *05 that will 
lie below - *711. Similarly the chance is *01 that will lie above 1*061, and *01 that it will lie 
below - 1*061. It is clear that for samples of 100 or less the sampling variation in Jfii is very 
considerable so that it will be impossible to detect even large departures from normality in the 
population. The table however should be useful even if only to emphasise this fact. 


In calculating the i>ercentago limits of the Tyi)e VII curves it was possible to use the 
corresponding limits of the distribution as tabled by Professor Fisher J. This we may do by 
noting that for the general Type VII curve 


y=.yo 



( 6 ), 


where 


2fr»i52 

'A, -3’ 


5/?a-9 

"'‘" 2 ( 52 - 3 ) 


( 7 ). 


* G. H. Hansmann, “On certain Non-normal Symmetrical Frequency Distributions,” Biornetrika^ 
XXVI. pp. 129 — 96. 

f The following table shows the changing values of Bs as n increases: 

n 10 15 20 25 80 40 50 60 70 

52(V^i) 3*82 8-62 3*57 3-67 3*56 8-50 3*46 8*41 3*87 

X Statistical Methods for Research Workers^ Table IV, at end of work* 
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TABLE II. 


Size of sample 

Limits for V'/Si 

«7c 

10 7o 

25 

•711 

1-061 

.30 

•661 

•982 

35 

•621 

•921 

40 

•687 

•869 

46 

•658 

•825 

60 

•633 

•787 

60 

•492 

•723 

70 

•459 

•673 

80 

•432 

•631 

90 

•409 

•696 

100 

•389 

•567 


If in Professor Fisher^s notation n is the number of d^rees of freedom, then (6) becomes the 


t distribution on writing 

^-4(n-hl) (8), 

xjassitUn (9). 


Equations (1) and (2) with (7) give the appropriate values of a and m, the n with which to 
interpolate in the t tables is given by (8), and the resulting 6 and 1 values for t when 
inserted in (9) will give the required probability levels for = 

It is hoiked shortly to carry out a similar investigation regarding the sauqiling distribution 
of /3a. 


(iv) Note on Dr Ueher^e Paper on SSpicanthus 
(pp. 6 — 95 of this irolume). 

By KARL PEARSON. 

The interpretation of the valuable material collected by Dr Usher is not easy. In his 
statistics he has excluded Figs. 24 and 25. Further “continuous’’ and “discontinuous” 
inheritance may be interpreted differently by diflfei'ent readers. The terms indeed lose much of 
their meaning, if we accept the view that epicauthus may disappear with growth ; because by 
predicating a loss we can transfer a pedigree from the discontinuous to the continuous group. 
Further are we to reckon by “offspring” or by “parents”? Neglecting possible disappearance by 
growth, the Table on the following page represents the data presented by Dr Usher, and the 
reader is left to interpret the classification as he pleases. 
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Tabulated Da;ta of Dr Usher* a Epioanihus Paper. 


Number of OaBe« 


Inheritance 


3 Female ^ 

given 


Through Male Through Female Disoontinnous 
Offspring Fathers Offspring Mothers Offspring Parent 


68 

( + ? 8 ) 


Not inoluded by Dr usher. 
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BIOMETRIKA 


STATISTICAL PRINCIPLES OF ROUTINE WORK IN 
TESTING CLOVER SEED FOR DODDER. 


By J. PRZYBOROWSKI and H. WILEl^SKI. 


(Plant Breeding and Agricultural Experimentation Department, University 

of Cracow, Poland.) 

I. Introductory, 

The testing of red clover and other small seeds for dodder is one of the 
important everyday tasks of a Seed Testing Station. As the result of such tests 
can only be probable and never certain, it is clear that the routine work of these 
Stations must have a sound statistical basis. The only other writer on this subject 
that we know of is Schindler*, but we wish to attack the problem from another 
standpoint. 

The samples of clover seeds which are tested contain usually 100 — 200 gm., or 
over 50,000 single seed of clover. If in such a sample there are found several 
dodder seeds (the number varies from 0 to 36 per kg. according to the different 
countries) this would mean an unfavourable result of the test, and the seed would 
be considered unsatisfactory. Therefore in sampling clover we are in the position 
of making very numerous (over 50,000) and practically independent trials, and of 
counting those trials in which a definite event occurs. Each of these hypothetical 
trials consists in selecting at random a single seed to form the sample, the event 
being that the selected seed is a dodder seed. 


It is known that if the number of independent trials, t?, is very large and the 
probability, of an event, in these trials is constant and very small, then the 
number of occurrences of this event, k, follows the Poisson Law 


I 


k\ 


.( 1 ), 


where m « np, the mathematical expectation € (k) = m and the standard error = Vm. 

W. Bortkiewicz"|“ was the first to call attention to the practical use of the 
Poisson Law. Since then it has been applied to numerous problems J. 


* J. Sohindler: ‘ * Untersuohnngen ilber Eleeseide-Yerteilung in sohwaoh seidehaltigen Kleesamen.*’ 
Landwirtschaftliche VerstLcha-Stationen^ Bd. cviii, Leipzig 1929. 
t W. Bortkiewioz: Dan Oenetz der kleinen Zahlen, Leipzig 1898. 

t “Student”: “On the Error of Counting with a Haemacytometer.” Biametrikaj Vol. v, 1907. 
K. Iwaszkiewicz and J. Neyman: “Counting Virulent Bacteria and Particles of Virus.” Acta Biologiae 
Experimentality Vol. vi. Warsaw 1981. [The caution with which the Poisson Series must be applied is 
emphasised by L. Whitaker: Biometrika^ Vol. x. pp. 86 — 71. In all oases of applying Poisson's Series 
the corresponding binomial ought to be worked out. This in the case of much of the data treated by 
Bortkiewioz, Mortara and others turns out to be a negative binomial, and the application of a Poisson 

Blometrika xxvn 19 



274 Testing Clover Seed for Dodder 

The purpose of this paper is twofold: (i) to show that we are justified in 
assuming that the number of dodder seeds in samples of clover does follow the 
Poisson Law and (ii) to construct rules, based on this assumption, which should be 
followed in routine work in order to minimise the chance of errors in tests for 
dodder. Some of these rules may have wider application in other sampling 
problems where the Poisson distribution holds good. 

II. Tenting Goodness of Fit of the Poisson Series, 

When testing the assumption that the distribution of dodder in samples of 
clover seeds follows the Poisson Law we have to keep in mind the circumstances 
which may cause variations. In all practical cases we are justified in assuming that 
the probability of a randomly chosen seed being a dodder seed is very small. 
The only assumption which may be untrue is that of the independency of our 
“ trials.*' In fact a sample of about 50,000 grairjs to be tested for dodder is never 
taken by selecting at random single seeds. On the contrary the samples which 
are analysed are usually made up of seeds which were in close proximity to 
each other in the sack. Therefore if the distribution of dodder in the whole mass 
is not random it will not follow the Poisson Law in the samples. So the problem 
before us consists really in testing whether the clover seed in ordinary trading 
conditions is properly mixed or homogeneous with regard to dodder. This has 
been tested experimentally for us by Broniewski. By means of his experiment we 
contrived to obtain a sack of clover seed containing a known number of dodder 
seeds, where the material was mixed to a degree similar to that obtained in seed 
trading. 100 kg. of red clover seeds were mixed with 2000 dodder seeds; the 
latter were dyed in order to make them easily distinguishable. In practice the 
homogeneity of seeds is increased by different processes of threshing, cleaning, etc., 
then the seed is transported in sacks, often on ba<l roads, and here the homogeneity 
may be diminished through continual shaking, the smaller dodder seeds slipping to 
the bottom. We have tried to allow for this in our experiments. After they were 
mixed and put into a sack, the seeds were carried in a four-wheeled waggon on a 
bad road over a distance of about 6 km. Then beginning from the top of the sack, 
500 clover samples of about 100 gm. each were taken and examined for dodder. 
The result was that 986 dodder seeds were found. The small deviation from 1000, 
which was the expected number of dodder seeds in the 60 .kg. analysed, may be 
regarded as the result of random sampling. The possibility should also be taken 
into consideration that in the course of the experiment a few dodder seeds may 
have remained unnoticed and therefore not included in the 986. 

In Table I A the empirical results are compared with the theoretical numbers, 
given by the Poisson Series, assuming that the mathematical expectation m » 2, 
i.e. the a priori known mean content of dodder per sample. 

Series is of doubtful validity. The origin of negative binomials has been disoussed by K. Pearson: 
Biometrika, Vol. xi. pp. 139 — 144 and especially by “Student”: IHd. Vol. xii. pp. 211 — 215. These 
papers should certainly be studied by those who simply apply the P, x* teat, without consideration of the 
binomial of which the Poisson Series is supposed to be the limit. En.] 
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TABLE I. 

Distribution of Dodder in 600 Samples of Clover Seeds. {Our Eicperiment.) 
A. B. 



m- 

= 2 

m= 1-972 

h 


iV.P* 

k 


N. Pfc 

0 

56 

67-67 

0 

56 

69-59 

1 

15G 

135*34 

1 

156 

137*23 

2 

132 

135-34 

2 

132 

136*31 

3 

92 

90-22 

3 

92 

88*94 

4 

37 

45-11 

4 

37 

43-85 

5 

22 

18-04 

5 

22 

17-29 

6 

4 

6-021 

6 

4 

5-681 

7 

8 

0 

1 


7 

8 

? 

l-60l 

0-39 M 

over 8 

0 

0 - 12 J 

over 8 

0 

0 - 12 ; 

n' = 

fy 7^e= 7^' — 

1, v*=8-9ie9 

n'^7 

71— n' - 1 

• v“=8-7699 


7\a«0*179 


Px« = 0-189 


/;3«nuTnl^r of dodder seeds in a sample. 
iVjfc= observed frequency. 

N . 74— expected frequency. 
n' — number of grou}>8. 

71 =5s degrees of fi’eedom. 


In Table I B we have accepted m as unknown, i.e. we have considered those 
samples as drawn from material, the dodder content of which is entirely unknown 
to us; thus we estimated m from 

W = ^ XkNit (2), 

where k is the number of dodder seeds in a 100 gm. sample; the number of 
samples in which we have found k seeds; N the general number of the investigated 
samples. Here the approximate value of m = 986/500 == 1’972, 

The diagram corresponds to the numbers given in Table I B. 

It will be seen from both tables that the Poisson Law fits extremely well to 
our data. Thus we may conclude that if the ordinary trading conditions are 
similar to the conditions of our experiment, which we believe to be true, then the 
assumption that the frequency of dodder in samples of seeds follows the Poisson 
Law is genemlly justified. 

* According to the view of K. A. Fisher, E. S. Pearson and J. Neyman the degrees of freedom in this 
case should be n'»2, and accordingly — 0*120, i.e. le»$ than for ms:2, which is 0*179. 


18—2 
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TABLE II 

Distribution of Dodder in Samples of Clover Seed, (Schindler's Experiments,) 

12 3 4 


k 


N.I\ 

k 

Nu 

N.Pfc 

k 

H 

N.Pj, 

k 


N.P* 

0 

168 

183-94 

0 

599 

606-63 

0 

382 

389*40 

0 

284 

303*27 

1 

205 

183-94 

1 

315 


1 

111 

97-36 

1 

reii 

151*68 

2 

94 

91-97 

2 

74 

75-82 

2 

7 


2 

39 

37*91 

3 

26 

30-66 

3 

12 

12-641 , 

over 2 

0 

1*08/ 

3 

7 

7*19 

4 

6 


over 3 

0 

1-74/ 




over 3 

0 

0-87/ ^ 

5 

1 

1-6319-49 










over 6 

0 

0-30 J 










1— 


1 

m 


0-5 

m 



m 


0*6 



6 

n' 


4 

nl 


3 

v! 


4 

M 


5-1990 



0-9848 



5*0027 

X* 


3*4863 

'BH 


0*270 

P^i 

1 

0-804 

A* 


0*082 



0-328 


k 


JV.Pj 

■■ 


N.Pu 

k 

Nit 

N.P, 

B 

Nk 

N.Pj, 

0 

795 

774*04 

0 

447 

452-42 




0 

295 


1 

94 

116-20 

1 

51 

45-24 

1 

|m 


1 

153 

151-63 

2 

11 


2 

2 

o-os; ^ 

over 1 

1 

0*61 

2 

44 

37-91 

over 2 

0 

over 2 

0 




3 

8 

7-10 
0-87/ ' 










over 3 

0 

rn 


0*15 

m 


0*1 

in 


0-05 



0*5 

n' 


3 

n' 


3 

n! 


3 



4 



5-1280 



0-8477 



0-4709 



1*3075 

A* 


0-078 



0*666 



0-816 



0-731 


10 


11 


ft 


N.V^ 

k 

Nk 

N.Pjc 

k 

Nk 

N.Pk 

0 

22 

16*42 

0 

0 

1*08 

0 

0 

0*09 

1 

29 

41-04 

1 

3 

5-39 

1 

0 

0-66 

2 

56 

61-30 

2 

13 

13-48 

2 

1 

2-49 

3 

43 

42-75 

3 

15 

22-46 

3 

4 

6*22 

4 

34 

26-72 

4 

33 

28-07 

4 

9 

11*67 

6 

10 

13*36 

6 

28 


6 

16 

17-50 

6 

3 

6-57-1 

6 

24 

23-40 

6 

19 

21-87 

7 

4 

1*99 }• 8-41 

7 

21 

16*71 

7 

19 

23-44 

over 7 

0 

0*85 J 

8 

10 


8 

26 

21*97 




9 

8 

5-80] 


9 

19 

18-31 




10 

0 

2*90 


10 

16 

13*73 




11 

4 

1-32 

11 

14 

9-36 




over 11 

1 



12 

5 

5-86^ 







13 

6 

3-38 







14 

3 

1*81 \ 12-68 







15 

3 

0*90 







over 16 

1 

0*74; 

m 


2-6 

m 


5 

m 


7*6 



7 

n' 


10 

n' 


13 

X* 


8*7617 



7-0406 

a" 


9*8060 



0*189 



0*633 



0*633 


j^Mnnumber of dodder seeds in a sample. 7i'«Mnumber of groups. 

Njf m observed frequency. n' - 1 ■» degrees of freedom. 

Pik^expeoted frequency. 
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Table II givets the results of fitting the Poisson Series to data published by 
Schindler. The results are often much better than in our case, but this is probably 
due to the fact that in Schindler’s experiments special efforts were made to increase 
the homogeneity of the material. These efforts increase the agreement between 
the observed and expected frequencies but diminish the practical importance of 
the results. 

Although we think that the homogeneity of material in our experiment was 
similar to that in ordinary trading conditions, it is useful to study the dodder 
distributions in samples drawn by the usual methods from stocks prepared for 
sale according to the ordinary trading treatment. One set of datii of this kind is 
analysed below. (See p. 281.) 

Lastly it should bo noted that the independence of samples in Schindler’s 
experiments as well as in ours, was by no means complete. However, the correlation 
could only be very slight and not have any significant influence on the numerical 
results. Therefore it will be assumed in the following sections that the number of 
dodder in samples of clover seed follows exactly the Poisson Law. 

HI. Accuracy of the Analysis for Dodder, 

When given seed is analysed for dodder, one of the two following questions 
may be asked: 

(i) Does the dodder content in the seed exceed a certain fixed standard ? 

(ii) What is the actual dodder content in the seed? 

The solution of the first question involves what is called the test of a statistical 
hypothesis. The other question is an instance of another statistical problem, that 
of estimation. 

(a) Testing the Hypothesis concerning Dodder Content. After examining a sample 
of seeds, we want to estimate the average number of dodder seeds in a unit of 
weight, e.g. in 1 kg. of the stock under consideration. Practically we are only 
concerned with the question whether that average content does or does not exceed 
the legal, customary or agreed standard. According to the trading cu8t<.>ras accepted 
in various countries this usual standard varies from 0 to 36 dodder seeds per kg. 
The accepted standard, let us say, thq dodder seeds in 1 kg. of stock, is a limit 
between two classes of clover seed on the market. This limit will be called the 
tolerance limit of the true mean dodder content. Stocks containing two or less 
dodder seeds in a unit of weight belong to one class, while those containing more 
than thq belong to the other class. In each case we are faced with the problem of 
deciding to which class the material under consideration belongs. 

In the theory of testing statistical hypotheses, initiated by J. Neyman and 
E. S. Pearson*, we distinguish two kinds: 

(1) a simple hypothesis which completely specifies the probability law, 

(2) a composite hypothesis which specifies the probability law only partially. 

* J. Neyman and E. S. Pearson. ‘‘On the Problem of the most efficient Tests of Statistical 
HyiK>theses.” Phil. Tran^. Hoy. Soc. 1988, Vol. 281 A. pp. 289 — 837. 
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It will be seen that when a composite hypothesis is given there are many 
different ways in which we may complete the specification of the probability law, 
and thus transform the given composite hypothesis into different simple ones. 
These hypotheses form a class of simple hypotheses “belonging” to the given 
composite one. 

In our case the statistical hypothesis we test will concern the true mean 
number of dodder seeds in samples of clover seed. This number will be denoted 
hy m] X will denote the number of dodder seeds in a sample. According to our 
previous convention it will be assumed that 

= (3) 

represents the frequency law of x, given m. Any hypothesis specifying the value 
of m, say m^nii, will be a simple hypothesis, the probability law p{x) being 
completely specified. On the other hand if a hypothesis states only that m > m,: 
or m^nii or mi<7n^rnj, then the hypothesis does not specify the probability law 
completely and is a composite one. Simple hypotheses belonging to the composite 
which states that m < 5 will be all those which ascribe to m any value less than 5, 
e.g. m = 1, 1*6, ..., etc. 

Denote by nio the conventional tolerance limit of the dodder content per 100 gm. 
of seeds. The practical question whether any given seeds contain more or less 
dodder than the tolerance limit allows may be reduced to the mathematical 
problem of testing one or other of the following statistical hypotheses. Either we 
may test the hypothesis m^nio or the hypothesis m> mo. 

The first of these hypotheses will be called the favourable hypothesis and 
denoted by F. The other hypothesis that m > mo will be called unfavourable and 
denoted by U, Obviously both F and IT are composite hypotheses. 

When testing statistical hypotheses one must bear in mind two kinds of 
possible errors; those of the first kind which consist in rejecting the hypothesis 
when it is true and those of the second kind which consist in accepting the 
hypothesis when it is false. Neyman and Pearson have developed a theory which 
allows us to choose from all possible criteria one which is able to control, in the most 
efficient way, both kinds of errors*. However, our present problem is so simple that 
there is no need to apply their theory, as the observations only supply us with one 
number, a?, which is the number of dodder seeds in the sample, and which therefore 
is the only criterion to use. 

If we were testing the hypothesis F, stating that m^mo, we should reject the 
hypothesis if x, the number of dodder seeds in the sample, is “too large.” On the 
other hand if x is not “too large” we should accept F. If it were the hypothesis 
U which we were testing we should proceed the other way round and reject U 
whenever x is “sufficiently small,” say a?<a. Here we shall only use the Neyman- 

* See J. Przyborowski et H. Wilenski : “ Bur les erreurs de la premiere et de la seoonde categoric dans 
la V^ridoation des Hypoth^es oonoernant la loi de Poisson.*’ Comptes rendus des stances de V Academic 
dec ScUnocMf Vol. oo. p. 1460. Paris, 1985. 
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Pearson theory in order to give an exact meaning to the words in inverted commas, 
namely, *'too large” and “sufficiently small.” Wc shall have to take into considera- 
tion the probability of both kinds of errors which would correspond to any number 
which could be chosen to serve as a conventional limit between the values of x 
which are “too large” and the values which are not “too large.” 

In the following we shall discuss the problem, assuming that we are testing 
the hypothesis ^7'. As it was stated this hypothesis will be rejected, and thus the 
seed will be considered satisfactory, whenever x is “sufficiently small.” Denote by 
a any number which could be used as a conventional limit between the values of 
X which are “sufficiently small” and those which are not. This will be called the 
tolerance limit of dodder content in the sample. If a sample contains a dodder 
seeds or less, the hypothesis U will be rejected. 

Let us estimate the probability, say Pi (mo, a), of first kind errors corresponding 
to the fixed values of a and mu. Denote by p (?a) the probability law a priori 
concerning the mean number, m, of dodder seeds per sample. Obviously 

TOO 

Pi (jiifi , a) ** I P {{x a) I m] p (m) dm (4), 

Jtn„ 

where P {{x^a)\m] is the probability corresponding to the fixed value of m of 
having a dodder seeds, at the most, in a sample. In the above formula p (m) is 
unknown. Therefore the actual value of Pi (mo, a) is unknown. However, we can 
find a value which Pi (mo, a) never exceeds. 

Obviously for any value m > m© we shall have 

P [{x < a) I m} < P {{x ^ a) | m©} (5). 

Therefore 

Pi (wio, a)^P {(a; < a)l m©} I p (m) dm ^P {(x^ a) | mu] (6) 

J mo 

as the integral 

I p{m)dm^l (7), 

J mo 

whatever the positive number m© and whatever the unknown function p(m). 
It is thus seen that though we are not able to calculate the actual value of the 
probability of first kind errors, we are able to determine the upper bound of this 
probability. This is given by 

P \{x < a) I r/i©) a= S 6”^ “■“if a^e~^dx (8). 

If in fixing the rules of testing the hypothesis U we choose the values of m© and a 
so that P {{x < a)|m©} =» 6, where e is any small number, say ‘05 or ‘01, chosen in 
advance, then we may be certain that in the long run the hypothesis tested will 
be wrongly rejected with a relative frequency not exceeding e. 

Now let us turn to the errors of the second kind which consist in stating that 
the seed tested is not satisfactory, whereas in fact the average amount of dodder 
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it contains does not exceed say mf < ma> Denote by a) the probability of 

such an error. Using all previous notation we shall have 

rm* 

Pjx {rn\ a) ■« I P {(jc> a)\m] p (m) dm (9), 

Jo 

where P {(as > a)\m] is the probability, corresponding to the fixed value of m, of 
having more than a dodder seeds in a sample. It is easily seen that owing to 
the function p (m) being unknown, we are not able to obtain the actual value of 
Pii (m\ a). However, as for m < w', 

P {(^r > a) I m} < P {(ic > a) | m'\ 

we may write again 

Pii (m', a) P {(^ > a) | m'] I p (m) dm ^ P {(.t > a) | m/\ 

Jo 

rm' 

as the integral I p (m) dm cannot exceed unity. 

Jo 

The formulae (6) and (11) give us the limits which the probabilities of first 
and second kind errors cannot exceed. However, it will be noticed that in unfavour- 
able cases these limits may be approached by the corresponding probabilities as 
closely as desired, and as in arranging the rules for routine work we have to think 
of the most unfavourable conditions, we may deal with the upper limits as if they 
were the values of the probabilities themselves. 

(6) Numerical Illustrations. The values of P {(as ^ d) | nio] and P {(as > a) | m'} 
are given in Tables III and IV respectively The use of these tables may be 
illustrated by the following examples: 

Example I. Suppose the tolerance limit m© of dodder per sample is fixed at 
tuq = 6. In other words we assume that if the true average content of dodder 
per sample in the seed tested exceeds m© ■■ 6, then it is agreed to consider such a 
seed unsatisfactory. Suppose further that the rules of testing are set up as follows: 
the Buyer is willing to accept the seed only when the random sample contains 
dodder seeds not exceeding, say, a = 5. In other cases the Buyer will refuse the 
seed. Thus the tolerance limit for the true mean dodder content is m = 6 and the 
tolerance limit of the dodder content in the sample is a = 6. We now turn to 
Table III in order to judge whether the interests of the Buyer are properly 
protected- We find that the value of P{(a?^a)| mo] corresponding to these limits 
is '446. This means that if the above rules are followed, then unsatisfactory seed 
will be accepted by the Buyer, with a relative frequency which, in unfavourable 
cases, may be as large as 44 per cent. It is obvious therefore that if the tolerance 
limit of the true dodder content is really m© = 6, the tolerance limit for the dodder 
content in the sample, fixed at a «= 5, does not present any real protection to the 
interests of the Buyer. 

^ These were oaloulated using the Tables of the Incomplete T-Funetiont edited by Karl Pearson 
and published by the Office of Biometrika, University College, liondon. Beissue 1984. 


( 10 ). 

( 11 ). 
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TABLE III. 

Upper Limits of the Probability of First Kind Errors : ‘P {(a; < a) \ mo}. 


X 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

/ «io 

01 

*JK)r> 








_ 



0-1 

0*5 

•007 

— 

— 


— 


— 

— 

— 

— 

— 

0*6 

1 

•368 

•736 

— 

— 



— 

— . 

— 

— 

— 

1 

2 

•135 

•400 

•677 

— 

— 

— 

— 


— 

— 

— 

2 

3 

•050 

•199 

•423 

•647 


— 

— 

— 

— 

— 

— 

3 

4 

•018 

•092 

•238 

•433 

•629 

— 

— 

— 

— • 

— 

— 

4 

6 

•007 

•040 

•1*25 

•265 

•440 

•616 

— 

— 

— 

— 

— 

6 

6 

•002 

•017 

•062 

•151 

•285 

•446 

•606 

— 

— 


— 

6 

7 

•001 

•007 

•030 

•082 

•173 

•301 

•450 

•599 

— 

— 

— 

7 

8 

— 

•(K>3 

•0J4 

•042 

•100 

•191 

•313 

•453 

•593 

..... 

— 

8 

9 


•001 

•006 

•021 

•055 

•116 

•207 

•324 

•456 

•587 

— 

9 

10 

— 


•003 

•010 

•029 

•067 

•130 

•220 

•333 

•468 

•583 

10 

11 



•OOl 

•005 

•015 

•038 

' -079 

•143 

•232 

•341 

•460 

11 

12 

— 

-- 

•001 

•002 

•008 

•020 

•046 

•090 

•155 

•242 

•347 

12 

13 

— 

— 


•001 

•004 

•on 

•026 

•054 

•100 

•166 

•252 

13 

14 

— 

— 

— 

— 

•002 

•006 

•014 

•032 

•062 

•109 

•176 

14 

15 

— 

_ . 

— 

— 

•(K)l 

•(X)3 

•008 

•018 

•037 

•070 

•118 

15 

10 

— 

— 


— 

— 

•001 

•004 

•010 

•022 

•043 

•080 

16 

17 

— 

_ . 

— 

— 

— 

•001 

•002 

•005 

•013 

•026 

•049 

17 

18 

— 

— 

— 

-- 

— 

— 

•001 

•003 

•007 

•015 

•030 

18 

19 

- - 

~ 


— 

... 

— 

•OOl 

•002 

•004 

•009 

•018 

19 

20 

— 

— 


— 

— 

— 

— 

•001 

•002 

•005 

•Oil 

20 

21 

— 

— 

— 

— 

— 

— 


— 

•001 

•003 

•006 

21 

22 

— 

— 

— 

— 

— 

— 

— 


•001 

•002 

•004 

22 

23 

— 


— 

— 


— 

— 

— 

— 

•001 

•(K)2 

23 

24 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

•001 

24 

25 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

•001 

25 


a =:= tolerance limit for the dodder content in the sample. 

„ „ true mean dodder content. 


Looking at Table III it will be seen that keeping the tolerance limit of the 
true mean dodder con bent at ttiq « 6, we may procure a reasonable level of safety 
for the Buyer if we fix the tolerance limit of dodder content in the sample at a = 2 
or even a = 1. This means that the seed will be accepted as satisfactory whenever 
the dodder content in the sample does not exceed 2 or 1 seeds. Even with such 
precautions the Buyer may get unsatisfactory goods. This, however, will happen 
but rarely and with a relative frequency not exceeding *06 in the one case and *02 
in the other. 

Example II. We have considered the problem of tolerance limits from the 
point of view of the Buyer, now let us consider it from the Seller’s. 

Assume that the tolerance limit for dodder content in the sample has been 
fixed at a as 2 to suit the interests of the Buyer who is not willing to accept seed 
with a true mean content of dodder exceeding mo »= 6 seeds per size of sample. 
The Seller will naturally be interested in the chance that the seed, which actually 
contains less than 6 dodder seeds per sample, should successfully pass the test. 
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The probabilities of second kind errors, i.e. that the seed will be unjustly rejected, 
are given in Table IV, where a is again the tolerance limit for the sample and m' 
the €tssumed true mean content of dodder in the seed tested. 


TABLE IV. 


Upper Limits of the Prohahilitp of Second Kind Errors : P {(a? > a) j m'}. 


N . m ' 

•05 

•1 

•2 

•25 

•5 

1 

2 

2-5 

6 

10 

20 

25 


0 

•049 

•095 

•181 

•221 

•393 

•632 

•866 

•918 

•993 




0 

1 

•001 

•005 

•018 

•027 

•090 

•264 

•594 

•713 

•960 

•999 

— 

— 

1 

2 

— 

— 

•001 

•002 

•014 

•080 

•323 

•466 

•875 

•997 

— 

— 

2 

3 

— 

— 

— 

— 

•002 

• 01 » 

•143 

•242 

•735 

•990 

— 

— 

3 

4 

- 



— 

— 

•004 

•053 

•109 

•560 

•971 


— 

4 

6 

— 

— 

* 

— 

— 

•001 

•017 

•042 

•384 

•933 

— 

— 

6 

6 


— 

— 

— 

— 

— 

•006 

•014 

•238 

•870 

— 

— 

6 

7 

— 

— 

— 

— 

— 

— 

•001 

•< X)4 

•133 

•780 

•999 

— 

7 

6 

— 

— 

— 

— 

— 

— 

— 

•001 

•068 

•667 

•998 

— 

8 

9 

— 


— 

— 

— 

— 

— • 

— 

•032 

•542 

•995 

— 

9 

10 

— 

— 

— 

— 


— 

— 

— 

•014 

•417 

*989 

— 

10 

11 

— 

— 

— 

— 

— 

— 

— 

— 

•005 

•303 

•979 

•999 

11 

12 

— 

— 


— 

. — 

— 

— 

— 

•002 

•208 

•961 

•997 

12 

13 . 

— 

— 

— 

— 

— 

— 

— 

— 

•001 

•136 

•934 

•994 

13 

14 

— 

-- 

— 

— 

— 

— 

— 

_ - 



•084 

•895 

•988 

14 

15 

— 

— 

— 

— 

-- 

— 

— 

— 



•049 

•843 

•978 

15 

16 


— - 

-■ 

— 

— 

— 

- 

— 

— 

•027 

•779 

•962 

16 

17 

— 

— 

— 

— 

— 

— 

— 


— 

•014 

•703 

•940 

17 

18 


— 

— 

— 

— 

— 



— 

— 

•007 

•619 

•908 

18 

19 

— 

— 

— 

— 

. — 

— 

— 

— 

— 

•003 

•530 

•866 

19 

20 

— 

— 

— 


— 

— 

— 



— 

•002 

•441 

•815 

20 

21 

— 

— 

— 

— 

— 

— 

- — 

— 

— 

•001 

•356 

•753 

21 

22 

— 

— 

— 

— 

— 

— 

-- 

— 

— 

— 

•279 

•682 

22 

23 

— 

— 

— 


— 

— 

— 

— 


— 

•220 

*606 

23 

24 

— 

- 

— 

— 


— 

— 

— 

— 


•157 

•527 

24 

25 

— 

— 


— 

— 

— 

— 

— 

— 

— 

•112 

•447 

25 

26 

— 

— 

— 

— 

— 

— 

— 



— 

— 

•078 

•371 

26 

27 



— 

— 

— 


— 


— 



•053 

•300 

27 

28 

— 

— 


— 


— 





— 


•034 

•237 

28 

29 

— 


— 


— 


— 






•022 

•182 

29 

30 


— 

— 

— 

- 

— 

— 

— 

_ 

— 

•014 

•137 

30 

31 

— 

— 


— 

— 

— 

— 

— 

. — 

— 

•008 

•100 

31 

32 


-- 

— 


— 


— 





•005 

•072 

32 

33 

— 

— 


__ 


— 

— 





•( K)3 

•050 

33 

34 

— 

— 



— 


— 



— 

•001 

•034 

34 

35 


— 

— 



— 

— 

— 

— 


•( X)l 

•023 

35 

36 

— 

— 

— 

— 

— 

— 

— 

— 

— 


— 

•015 

36 

37 

— 


— 

— 

— 

— 

— 

— 


— 

— 

•009 

37 

38 


— 

— 

— 

— 

— 


— 


— 

.... 

• oof ; 

38 

39 

— 

— 

— 

— 


— 

— 

— 

— 

— 

— 

•003 

39 

40 

— 

— 

— 


— 

— 


— 

— 

— 

— 

•002 

40 

41 

— 

— 

— 

— 

— 

— 

— 

— 

— 


— 

•001 

41 

42 

— 


— 

— 

— 

— 

— 

■ — 

— 

— 

— 

•001 

42 


aa»tolerauco limit for dodder content in a Hample. 

asHumed true mean dodder content i>er size of sample. 
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TABLE IV (continued). 



It will be seen that if the seed tested contains, on the average, as many as 
m' =■ 6 grains per sample, then the probability of the seed not being passed by the 
test is equal to ‘875. In other words, if the interests of the Buyer are met by 
choosing the sample tolerance limit a = 2, then in 87'5 per cent, of cases the seed 
offered will be found unsatisfactory, showing more than two dodder seeds in a 
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sample, when in fact it contains » 5 dodder seeds per sample, thus satisfying 
the requirements of the Buyer. A reasonable chance of passing the test successfully 
corresponds to » *5. Here the probabilities of an unjust rejection of the seed 
is *014. 

Thus in order to secure a reasonable chance of his goods passing the test, the 
Seller must aim at a dodder content about 12 times smaller than that which is 
fixed as the tolerance limit mo»6. 

The position is most unsatisfactory, as whenever the test is efficiently protecting 
the interest of one of the contracting parties, the interests of the other are seriously 
damaged. 

Now let us assume that the samples tested are, say three times, larger so that 
the tolerance limit of the true mean dodder content is correspondingly increased 
to fWo 18. It will be seen that a reasonable protection of the Buyer's interests 
will be attained if the tolerance limit of the dodder content in the sample were 
fixed at, say, a = 10. Doing so we may be certain that the corresponding probability 
of first kind of error will not exceed -03. (See Table III.) On the other hand, looking 
at Table IV, we see that the seed, whose true mean dodder content m' == 5, will 
have a reasonable chance of passing the test, the upper limit of the probability of 
second kind error being -014. Now the ratio mo/m' = 3*6, and thus the gap between 
the requirement of the Buyer and the safety level of the Seller is shortened very 
considerably. It will be seen that the use of larger samples is most beneficial both 
to the Seller and the Buyer. Unfortunately this means a larger expenditure on 
analysis. However, we shall make further mention of this point below. 

Example III. We shall here call special attention to the case when the toler- 
ance limit of dodder content in the sample is fixed at the lowest possible value 
a = 0. It is interesting to see exactly what sort of protection this provides to the 
interests of the Buyer, Looking at Table III we find that if m© == 3, then, on the 
average, one sample out of 20 would be totally free from dodder. The test applied 
will thus give a favourable result and the seed will be accepted. We see therefore 
that statements like “dodder free** if they are based upon analysis of only one 
sample of usual size may be frequently misleading. The seed which is described 
as “dodder free** is quite likely to contain as many as two dodder seeds per size of 
the sample tested, it is also quite possible that it contains three; in fact we should 
only be surprised when the true mean content is as large or larger than four. 

If the size of the sample tested is large, then a dodder content of 2 or 3 grains 
per sample could be perhaps considered as negligible. But in the ordinary size of 
sample such amounts would be disappointing, especially in a seed which has been 
qualified as dodder free. 

(c) Upper Confidence Limits of Dodder Content, The last example leads us to the 
new form of problem of estimation which has been started by R. A. Fisher* and 

* R. A. FUher: Proc, Cam, Phil, 8oc. Vol. xxvi. part 4. 
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then (Jcvelopod by J. Neyman*, namely the problem of confidence limits for a given 
unknown parameter. In our case the role of this parameter is played by the true 
mean dodder content per size of sample in a given consignment of seed. 

Wo think that the best way of expressing the results of an analysis for dodder 
content consists in stating the limit which presumably is not exceeded by the true 
mean dodder content, e.g. the seed does not contain more than, say, 4 grains of 
dodder per kg. The limit stated, viz. 4, is what is called the upper confidence 
limit. J. Neyman has shown how statements of this kind can be made so that the 
probability of their being correct will not be smaller than a limit fixed in advance. 
This limit of probability has been called “the confidence coefficient/* If it is said 
that a seed testing station works at a confidence coefficient, say a = *99, it means 
that its verdicts concerning the quality of the seed in the form stated above may 
be wrong only with a relative frequency not exceeding 1 — a or one in a hundred. 

The upper cionfidcnce limits, corresponding to the chosen confidence coefficient 
a, are found as follows: <lcnote by a any integer number of dodder seeds which may 
be found in a sam|)le, and find such a value of m^ma that 

P {(a; > (i) I nia] = a (12). 

The numbers thus defined are the upper confidence limits. Suppose, in fact, 
that the above equation is solved for any a « 0, 1 , 2, . . . , and suppose we make a 
rule of stating that the true mean dodder content, m, in seed docs not exceed ma 
whenever the analysis of a sample gives ic = a. It is easy to see that the probability 
of an error in such a statement never exceeds the limit 1 — a. 

This may be proved as followsf : wc notice first that an error in estimating the 
dodder content by stating the upper confidence limit may occur only if the actual 
dodd(‘r content m and the dodder content in the sample satisfy certain inequalities. 


Namely, W(^ shall be wrong whenever, say, 

nia < rtta^i (13) 

and (14) 

happen at the same time. 


Let us estimate the })robability of the simultaneous occurrence of the above 


two inequalities. For this purpose we notice that 

P ((ir < a) I m) = 1 — P {(a? > a) ] m} (15). 

If 7Ha < 'ffh then obviously 

P [(a; > a) I m} > P {(oj > a) I ma\ ~ « (15). 

Thus wt‘ shall have 

P ((a;^ a)|?r4j < 1 — a (17). 


This being true for any m satisfying (13) and for any a, we conclude that whatever 
the' unknown frequency distribution of rn, the probability of having (13) and (14) 

* .T. Keyman : Journal R. Statistical Soc. Vol. xevn. pp. 589 — 698. 

+ See T. MatUH/.ewskL, J. Neyman and J. Bupi^aka: Supplement to the JoumaX 12. Statistical 
Soc. Vol, 11 . part 1. 
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simultaneously does not exceed 1 a. This is another way of saying that the 
probability of errors in statements m^ma made when we get, in the sample^ 
exactly a? *= a dodder seeds, does not exceed 1 — a. 

The upper confidence limits for the mean dodder content corresponding to 
different confidence coefficients, a, are given in Table V*. The use of this table 
may be illustrated in the following example; 

Example IV. Suppose in three different samples of 100 gm. of clover seeds 
the amount of dodder found was oJi « 0, — 3 and a?8 = 10 respectively. The 
statement concerning the true mean dodder content in the seeds tested must 
depend upon the caution of the testing station. If it be considered that errors in 
statements occumng, on the average, once in ten times may be admitted, then the 
confidence coefficient chosen must be *9. In this case the statements concerning 
the true dodder content per 100 gm. of seeds in the consignments tested, should be 

mi < 2*3, mg < 6'7, ni^ < 1 5’4. 

(In order to obtain the estimates per 1 kg. we have only to multiply the above 
figures by 10.) 

However, it may be considered that this frequency of errors, i.e. 1 in 10, is too 
considerable. Then the confidence coefficient chosen should be larger. It may be 
desired to commit errors as seldom as one in a hundred, then a == ‘99 and our 
statements about the dodder content in the same consignments would be 

wi < 4‘6, mz < 10*0, mz < 20*2. 

We notice that the above example suggests again that the reliability of tests for 
dodder requires the use of larger samples. In fact, even when no dodder seeds are 
found in a sample, if we do not want to risk an error more than once in a hundred 
statements, we should only state that the true dodder content in the seed does not 
exceed 4*6 grains per size of sample. If this is large then the accuracy may be 
sufficient; if, however, the sample is small then it may be considered unsatisfactory. 

The question of the size of the sample may 'be treated more accurately in the 
following way: 

Fix any confidence coefficient, say a ■■ *99, and suppose that the samples in the 
above examples contained 100 gm. of seeds. The result of the test of the first 
sample may be stated by saying that in the seed analysed there was probably no 
more than 46 grains of dodder per 1 kg. 

Now assume that the sample contained 6 kg. of seed, and that again no dodder 
was found. Our estimate per size of sample will be the same, i.e. not more than 
4*6 grains on the average. But if we turn from the size of sample to the standard 
unit of 1 kg., the statement will change. We shall state, in fact, that in the 
consignment there is presumably no more than *9 grain of dodder per 1 kg. In 
this way it is easy to calculate that if for the sample which proves dodder free it 
should be sufficient to state that the amount of dodder per 1 kg. of seed does not 
exceed 1 grain, then the size of this sample should be as large as 4*6 kg. 

* This was again oaloulated using the Tabl^ of the Incomplete V-Function, 
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TABLE V. 

Upper Confidence Limits, ntatfor the True Mean Dodder Content. 



0-9 

0*96 

0-98 

0-99 

0-995 

0-999 


0 

2-3 

3*0 

3-9 

4-6 

5*3 

6-9 

0 

1 

3-9 

4-7 

5*8 

6*6 

7-4 

9*2 

1 

2 

5*3 

6-3 

7*6 

8-4 

9-3 

11-2 

2 

3 

6*7 

7’8 

9-1 

10*0 

11*0 

13*1 

3 

4 

8-0 

9*2 

10*6 

11*6 

12-6 

14*8 

4 

5 

9-3 

10*5 

12-0 

13*1 

14-2 

16-6 

5 

6 

10-5 

11-8 

13-4 

14-6 

16*7 

18-1 

6 

7 

11-8 

13-1 

14-8 

16*0 

17-1 

19-6 

7 

8 

13-0 

14*4 

16-2 

17*4 

18-6 

21-2 

8 ; 

9 

14-2 

15*7 

17*5 

18*8 

20-0 

22-7 

9 ^ 

10 

16-4 

17-0 

18«8 

20*2 

21*4 

24*1 

10 

11 

16*6 

18*2 

20-1 

21*6 

22 ’8 

25-6 

11 

12 

17-8 

19*4 

21-4 

22*8 

24*1 

27-0 

12 

13 

19-0 

20-7 

22*7 

24-1 

25*5 

28*4 

13 

14 

20-1 

21*9 

24*0 

25*5 

26*8 

29*9 

14 

ir , 

21*3 

23*1 

25*2 

26-7 

28*2 

31-2 

15 

10 

22-5 

24*3 

26*5 

28*0 

29*5 

32*0 

10 

17 

23*6 

25-5 

27-8 

29*3 

30*8 

34*0 

17 

18 

24-8 

26-7 

29*0 

30-6 

32*1 

35-4 

18 

19 

25-9 

27*9 

30*2 

31*9 

33*4 

36-7 

19 

20 

27-0 

29*1 

31*5 

33-1 

34-7 

38*0 

20 

21 

28-2 

30*2 

32*7 

34*4 

36*0 

39*4 

21 

22 

29-3 

31*4 

33-9 

35*6 

37-2 

40-7 

22 

23 

30-5 

32*6 

35*1 

36*8 

38'5 

42-0 

23 

24 

31-6 

33*8 

36*3 

38*1 

39*8 

43*3 

24 

25 

32-7 

34*9 

37*5 

39*3 

41*0 

44*6 

25 

26 

33-8 

36*1 

38-7 

40*5 

42*3 

46*9 

26 

27 

35-0 

37*2 

39-9 

41*8 

43-6 

47*2 

27 

28 

36-1 

38*4 

41-1 

43*0 

44*7 

48*5 

28 

29 

37-2 

39*5 

42*3 

44*2 

46*0 

49-8 

29 

.30 

38*3 

40*7 

43-5 

45*4 

47*2 

51*1 

30 

31 

39*4 

41-8 

44*7 

46*6 

48*4 

52*3 

31 

32 

40*0 

43*0 

45*8 

47*8 

49*7 

63*6 

32 

33 

41*7 

44*1 

47*0 

49*0 

50*9 

54*9 

33 

34 

42*8 

45-3 

48*2 

50*2 

52*1 

56*2 

34 

35 

43*9 

40*4 

49*4 

51*4 

53*8 

57*4 

35 

36 

45*0 

47*5 

50*5 

52*6 

54*6 

58*7 

36 

37 

46*1 

48*7 

51*7 

53-8 

55-8 

59*9 

37 

38 

47-2 

49*8 

62*9 

65-0 

57-0 

61-2 

38 

39 

48*3 

50*9 

54*0 

56*2 

58*2 

62*4 

39 

40 

49*4 

52*1 

55*2 

57*4 

69*4 

63*7 

40 

41 

50*5 

53*2 

56*4 

58*5 

60*6 

64*9 

41 

42 

51*6 

54*3 

57*5 

69*7 

61*8 i 

66*1 

42 

43 

52*7 

55*5 

58*7 

60*9 

63*0 

67*4 

43 

44 

53*8 

56*0 

59*8 

62*1 

64*2 

68*6 

44 

45 

54*9 

57*7 

61 *0 

63*2 

65*4 

69*8 

45 

40 

50*0 

58*8 

62*1 

64*4 

66*5 

71-1 

40 

47 

57*1 

59*9 

63*3 

65*6 

67*7 

72*3 

47 

48 

68*2 

61*1 

64*4 

66*8 

68-9 

73-5 

48 

49 

59*3 

02*2 

65*6 

67*9 

70*1 

74*7 

49 

50 

60-4 

63*3 

66*7 

69*1 

71*3 

76-0 

50 


a = dodder content in a sample, 
a confidence coefficient. 
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Calculations of this sort have been performed for six differeut values of the 
confidence coefficient for several values of a or the sample dodder content, and for 
several values of m'a or the upper confidence limit of dodder content per 1 kff. 
These results are presented in Table VI. The use of this table may be illustrated 
in the following example ; 

Example V. Suppose that the confidence coefficient has been fixed at a « *99 
and we are interested in the size of the sample which, if it contains say a « 3 
dodder seeds, would entitle us to state that the true dodder content in the whole 
consignment does not exceed rria = 5 grains per 1 kg. The size of the sample 
required will be found in Table VI in the column corresponding to a « *99, and 
in the row corresponding to a «= 3 and nia ~ 6. This is seen to be 2 kg. If the 
confidence coefficient had been fixed at a = *95, then the amount of seed forming 
the sample could be smaller, namely 1*56 kg. 


TABLE VI. 

Size of Samples to be tested assuring sufficient Accuracy in the Estimate 
of the True Dodder Content per 1 kg. 



a»nutrtber of dodder seeds in a sample. 

W** upper confidence limit of dodder content per 1 kg. 
a confidence coefficient. 
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IV. Practical Goncluaiona. 

We have seen above that if the tolerance limit for the true mean dodder 
content is comparatively low we have to face the fact that the method of testing 
is liable to greatly injure the interests either of the Buyer or of the Seller or both. 
This position is improved if the tolerance limit for the true dt>dder content is high. 
In order to work with high tolerance limits we have either to consider clover seed 
containing larger amounts of dodder as satisfactory, which is impossible, or to work 
with larger samples. 

It appears that it is extremely difficult to give reliable statements accu- 
rately describing the dodder content in seed, if these statements are to apply to 
small quantities of seeds such as a sack, as the size of the sample taken out of 
each sack which would provide sufficiently accurate information would have to be 
so large that the analysis would be too costly. A possible issue seems to consist 
in estimating the dodder content in larger quantities of seed such as 100 sacks. 

If such an amount of secjd is presented and there are grounds to believe that 
it is homogeneous, a seed station may undertake the analysis of the whole. 
Comparatively small samples may be taken out of each sack and analysed separately. 
The data obtained must be used for two purposes: (i) to test the hypothesis that 
the 100 sacks present, in fact, homogeneous seed, and (ii) to estimate the dodder 
content in the seed. If the result of (i) is favourable and the seed is proved to be 
homogeneous there will be only one estimate concerning the whole consignment. 
If however the result of (i) is unfavourable, it will be necessary to estimate the 
dodder content separately for each sack. 

The homogeneity test, mentioned in (i), is carried out in exactly the same way 
as that which led us to Table I B. If the seed proves homogeneous then the single 
estimate of the true mean dodder content will be obtained by combining all 100 
samples from the separate sacks into one large sample. As the sum of a number 
of variables each following the Poisson Law of Frequency itself follows the same 
Law all our tables will be valid to judge the accuracy of this estimate. Practically 
the work will consist in summing up the amount of dodder found in separate 
samples and in referring to Table V to get the upper confidence limit for the mean 
dodder content in the whole consignment per weight equal to the size of the 
100 samples combined. Dividing the upper confidence limit thus obtained by 100 
we shall obtain the confidence limit of mean dodder content per size of one 
sample. 

Obviously, even if the requirements as to the purity of clover seed are veiy 
strict, owing to the large number of partial samples from separate sacks, when 
dealing with the combined estimate we shall have to work with high tolerance 
limits. Thus the interests of both Buyer and Seller will be protected as far as 

It must be emphasised that the final verdict concerning the dodder content 
will apply to the whole stock of 100 sacks of seed. Among those 100 sacks there 



J. PbZYBOEOWSKI and H. WiLKiJsKI 


291 


will be several showing apparently smaller dodder content and others showing 
larger. However, it would not be right to conclude that these sacks differ in reality 
in their mean dodder content, as formerly it has been found that, when analysing 
the distribution of dodder content in samples from single ssu^ks, the variation is no 
more than what should be expected fi'om purely random causes when dealing with 
perfectly homogeneous material. In other words, we shall have to assume that if a 
sample from one sack contained less dodder than that from another, this difference 
is due solely to random variation and not to a real difference in the dodder content 
of the two sacks. 

Assume now another eventuality: namely, that the test showed a marked 
heterogeneity of seed in the 100 sacks presented for analysis. In this case the 
Seed Station should withhold from a general verdict concerning the whole stock, 
as this should be considered as heterogeneous. Separate estimates of the dodder 
content should be given for single sacks. These, of course, being based on the 
analysis of small samples, will be generally rather inaccurate, but we do not think 
there are any means of improving the position. It will perhaps be possible to 
increase the samples from single sacks, but this involves considerable expense. 

Let us illustrate the above procedure by an example. A stock of 100 sacks of 
clover seed has lately been analysed in the seed testing laboratory in our Depart- 
ment. A sample of 100 gm. of seed was taken out of each sack and analysed for 
dodder. Columns 1 and 2 of Table YII state the results of the analysis. It will 
be seen that 76 samples proved to be dodder free, 22 others contained 1 dodder 
seed, etc. According to the principles stated above we tested the hypothesis that 

TABLE VIL 

Distribution of Dodder Content in Samples from 100 Separate Sacks, 


k 


N.Pf, 

0 

76 

77’11 

1 

22 

20-06 

2 

over 2 

2 

0 

SS} 

w*s0*26 

n'-3, x*-0-4641, P*.-0-821 


number of dodder seeds in a sample. 

Aik observed frequency. 

A. expected frequency. 
n' ■■ number of groups. 

— 1 ■» degprees of freedom*. 

* According to the view of B. A. Fisher, E. S. Pearson and J. Neyman the degrees of freedom in 
this case should be n' - 2 and ~ *500. 
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the stock aQalysed was homogeneous. Column 3 of Table VII gives the frequencies 
calculated from the fitted Poisson Series. It will be seen that the fit is almost 
perfect, P= *821 (or *50, if degrees of freedom =1). Thus we can conclude that 
there arc no difficulties in formulating a verdict concerning the whole stock of 
seed as the differences between dodder content in samples from single sacks will 
be ascribed solely to individual random variation. 

In all the 100 samples of 100 gin. each 26 dodder seeds have been found; thus 
the dodder content is 2*6 dodder seeds per kg. Choosing the confidence coefficient 
*99 we find from Table V that the upper confidence limit of the true dodder 
content is 40*5 per 10 kg. or 4*05 per 1 kg. If the agreed tolerance limit for true 
dodder content is, say, 5 dodder seeds per kg., then the tested stock will be con- 
sidered as satisfactory. If, however, the agreed tolerance limit was only 1 dodder seed 
per kg., naturally the stock will be rejected. In Table IV we can find the upper 
bound of the probability of an error in this last judgment. This is lower than *001. 
The Seller will probably not question this judgment and will not propose to 
increase the sample. However, if the tolerance limit is just below the confidence 
limit of the true dodder content, for instance say, 3*5, then perhaps it would be 
worth while to repeat the sampling and base the final verdict on larger data. 

It will be seen that in the procedure which has been recommended above there 
are several points which are not completely specified. We deliberately refrain from 
propounding any definite rules. For instance, such questions as fixing the tolerance 
limits for the true mean dodder content and those of the confidence coefficients 
with which the seed testing stations should work, as well as the question of when 
the fit of the Poisson Series to the distribution of dodder seeds in samples from a 
number of sacks should be considered as showing heterogeneity of the material, etc., 
these should be answered by the Dodder Committee of the International Seed 
Testing Association, assisted by representatives of the seed traders. We think, 
however, that the work of this Committee should follow the methods indicated in 
the present paper. Among other matters it will be necessary to give an adequate 
definition to the term “dodder free” and to revise the meaning of the term 
“absolutely dodder free.” We hope that the results presented above prove 
sufficiently that the present use of these terms may often be misleading. 

I should like to add a word of thanks to Dr J. Neyman for his kind assistance. 



A STUDY OF PREDYNASTIC EGYPTIAN SKULLS FROM 
BADARI BASED ON MEASUREMENTS TAKEN BY MISS 
B. N. STOESSIGER AND PROFESSOR D. E. DERRY. 

By G. M. MORANT, D.Sc. 

1. Introduction. In the season 1924 — 26 an expedition of the British School 
of Archaeology in Egypt, under the direction of Sir Flinders Petrie, excavated 
predynastic sites in the neighbourhood of Badari, 30 miles south of Asyut. The 
remains found could be assigned to an earlier date than any previously discovered 
belonging to the same era. Fifty-nine skulls were preserved and the* majority of 
these were measured by Professor Derry in Cairo. They were subsequently sent by 
Sir Flinders Petrie to Professor Karl Pearson, then Director of the former Biometric 
Laboratory at University College, London, and the study of them by Miss Stoessiger 
(now Mrs Clapham) was published in 1927*. In the seasons 1928 — 30 Mr Bruntoh, 
who in 1924 had worked at Badari, leading a British Museum expedition, excavated 
more human remains associated with the Badari culture. A few of these came from 
Badari, but most are from a neighbouring site near Mostagedda. Eighty-three of 
the skulls were sent to Cairo and measured there by Professor Derry. He kindly 
placed his measurements of both series at the disposal of the present writer and 
those of the 1928 — 30 series arc given in the appended tables. The objects of this 
paper are to compare the descriptive data provided by the two observers and to 
estimate from them the nature and racial affinities of the population represented 
by the two series of crania. 

2. A Comparison of Measurements of the 1924 — 26 Badari Series taken hy 
Miss Stoessiger and Professor Derry. In Miss Stoessiger *s paper individual measure- 
ments are given of 69 skulls from Badari and unpublished measurements taken by 
Professor Derry are available for 63 of these. We thus have an unusual opportunity 
of comparing readings on the same specimens obtained by two observers working 
in different laboratories and following different techniques. Miss Stoessiger used 
the biometric technique without modification, and Professor Derry adopted that of 
the Monaco Congress f with some modifications and additions which he explained 
to the present writer. The measurements which are defined in these two schemes 
in ways which may be expected to give closely similar or identical results are the 

* "A Study of the Badarian Crania recently excavated by the British School of Archaeology in 
Egypt.” Biometrika, Vol. xxx. pp. 110 — 160. 

t He refers to the English translation of the Monaco report published by Dr W. L. H. Duckworth in 
1918 as: Intemational Agreements for the Unification {a) of Craniometric and Cephalometric Meaeuremente ^ 
{b) of Anthropometric Measurements to be made on the Living Subject (Cambridge University Press), 
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following (the biometric index letters being given first in the brackets, followed by 
the numbers of the Monaco report) : 

(i) Maximum calvarial length from the glabella in the median sagittal plane (X, 1). 

(ii) Maximum calvarial breadth (J?, 3). The two definitions differ since the 

biometric B is the maximum transverse diameter on the parietal bones, 
while the Monaco breadth is the maximum transverse whether found on 
the parietal bones or not. For Egyptian crania, however, the maximum 
breadth obtainable falls almost invariably on the parietal bones. 

(iii) Minimum frontal breadth (B\ 6). 

(iv) Basio-bregmatic height (XT', 4a). 

(v) Maximum horizontal circumference above the supra-orbital ridges and passing 

through the ophryon (U, 23 a). 

(vi) Minimum arc from nasion to bregma (/Si). 

(vii) Minimum arc from bregma to lambda (/S 2 ). 

(viii) Minimum arc from lambda to opisthion (/Sa). 

(ix) Total sagittal arc from nasion to opisthion (/S, 22). 

(x) Maximum bizygomatic breadth (X, 8). 

(xi) Breadth between lowest points on zygomatico-maxillary sutures {QB), 

(xii) Chord from nasion to basion (XX, 9), 

(xiii) Chord from nasion to alveolar point (G'Hy 12). 

(xiv) Nasal height from nasion to point furthest removed from it on the margin 

of the left pyriform aperture (NHy X, Professor Derry took this — the 
Frankfurt — nasal height and not the Monaco measurement 13). 

(xv) Maximum breadth of the pyriform aperture {NB, 14), 

(xvi) Breadth of right orbit from the dacryon (Oi',X, 16). Professor Derry writes: 

“Where the lacrymal bone is broken, or missing, I use the angle formed 
by the union of frontal and frontal process of the maxilla, which I believe is 
more easily determined, and therefore more exact, than many other so-called 
‘points’ on a skull.” The biometric practice is to omit the measurement if 
the position of the dacryon is uncertain owing to defect of the lactymal 
bone. Professor Derry gives the orbital breadth on both sides, and Miss 
Stoessiger that on the right side only. 

(xvii) Maximum heights of orbits perpendicular to their breadths (0a,i2 and 
02,X, 17). 

(xviii) Profile angle, i.e. the angle between the line joining nasion to alveolar 
point and the Frankfurt horizontal plane (PZ.). 

(xix) Bicondylar breadth of the mandible (w, 25). 

(xx) Symphyseal height of the mandible, i.e. the maximum chord from the tip of 

the process between the central incisors (intradental) to the lower border of 
the bone in the symphyseal plane (Ai, 29). 
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(xxi) Minimum breadth of the left ascending ramus of the mandible (r6^ 28 a). 

It is not suggested that the biometric and Monaco definitions of all the 
above measurements accord exactly, and, indeed, those of the latter scheme 
are not full enough in many cases to decide this point at all definitely. 
It may be hoped, however, that the two sets of readings will be found in 
close accordance for these 21 characters. Agreement would not be expected 
in the case of the following measurements owing to difierences in definition. 

(xxii) Auricular height. Professor Derry writes: “I take this with the skull on 
the craniophor and with the loTver margin of the left orbit on the same 
horizontal plane as the upper margin of the left auditoiy meatus. At times 
this is the same as the upper surface of the supports in the meatus, but 
more often it is above these owing to the slope of the bony roof of the 
meatus. In any case I place the point of the scriber against the margin of 
the meatus and then move it to the orbit and rotate the skull until the 
lower edge is at the same level. The sliding arm on the craniophor scale 
is then brought down until it just touches the skull lightly.’* The biometric 
practice is to make contact between the ear-rods of the craniophor and the 
poria in all cases and hence measurements obtained in the two ways cannot 
be expected to agree exactly. For the six comparisons which can be made 
Professor Derry’s reading is greater than Miss Stoessiger’s Off in all cases 
and the differences are between 1*4 and 3*2 mm. 

(xxiii) Transverse cranial arc between "the most prominent point on the posterior 
root of the left zygoma, exactly above the auditory aperture” and the same 
point on the right passing through the bregma (23). This must always be 
considerably less than the biometric vertical arc from porion to porioii (Q') 
and in the case of five skulls for which both are available the differences 
range from 11*6 to 18*0. 

(xxiv) Chord from basion to prosthion (10). This must be greater, almost in- 
variably, than the chord fiom basion to alveolar point, defined to be the 
lowest point on the process between the central incisors {OL). Comparisons 
between the two measurements can be made in the case of 47 skulls : 41 
show differences of the sign expected (the greatest being 3*5), two show 
zero differences and the remaining four differences of the opposite sign (the 
greatest being 1*2). 

(xxv) Interorbital breadth. Professor Derry writes ; " This is the actual width 
between the medial orbital walls when the two arms of the callipers are 
placed within the orbits and in contact with the medial walls.” His defini- 
tion differs from that of the Monaco technique (15) and from the biometric 
breadth between the dacrya (DC), Comparison with the latter can be made 
in six cases and the largest difference found is 1*3. 

(xxvi) Bigonial breadth (26). According to the Monaco definition this measurement 
is taken on the external surface of the jaw between the bilateral points "at 
which the angle is formed between the lower margin of the body of the 
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jaw and the posterior border of its ascending ramus.” This is practically the 
same definition as that of the biometric measurement which has been 
found to give unsatisfactory results in practice. The three differences 
(D. — S.) available are — 1*0, — 1*2 and + 2*3. 

(xxvii) Maximum breadth of ramus (28 6). There is no biometric measurement 
similar to this. 

(xxviii) Mandibular angle (32). This and the two following mandibular measure- 
ments were taken by Professor Derry on a mandible board, which he has 
described*, and which is precisely similar in construction to the instrument 
used by Miss Stoessiger. But the adjustment of the bone on the board was 
not the same in the two cases and hence the measurements are not com- 
parable. The few differences available confirm this supposition. Professor 
Derry 8 mandibular angle is similar to the biometric if Z . 

(xxix) Projective length of corpus : similar to biometric Cyl, 

(xxx) Projective length of ramus: similar to biometric rZ. 

The differences between the readings of the two observers for the same skulls 
may now be examined, in the case of the first 21 of the above measurements, to 
test whether the agreement is sufficiently close to be of no account in making 
comparisons with other scries. Data are given in Table I for combined male and 
female specimens. Professor Derry took eight of the measurements in question on 
a few only of the Badari 1924 — 25 skulls which are sufficiently complete to 
provide readings for them. The information regarding personal equation available 
for these eight measurements in the table is obviously inadequate, fewer than 10 
differences being given for each. As far as they go, however, the constants indicate 
that there is a satisfactory agreement for all except PZ and The definitions 
of the profile angle used by the two observers were the same, but there is reason 
to believe (sec under Auricular height, measurement (xxii) above) that the adjust- 
ment of the skull in the craniophor was not made in the same way. This probably 
accounts for the rather large differences found and it will be safest to conclude 
that Professor Derry's profile angles are not comparable with Miss Stoessiger’s. 
The same conclusion must be accepted in the case of the bicondylar breadth of the 
mandible (wi). Much larger numbers of differences are available for the remaining 
14 measurements — counting O 2 , R and O 2 , £ as different measurements — in Table I. 
Several of the maximum individual tlifferences in column 3 are considerably larger 
than those usually found between readings given by two workers in the same 
laboratory, and only four of the mean differences in the next column can be 
considered to differ insignificantly from zero (i.e. by less than three times their 
probable errors). The absolute sizes of the mean differences are of more importance, 
however, than the significance of their divergences from zero. They might be 
usefully compared with the probable errors of the means for the series, but these 
are as yet unknown. An approximation to them can be made by supposing that 


“A New Mylometer,^* Journal of Anatomy and Physiology, Vol. XLvm. (1914), pp. 480 — 481. 
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the standard deviations for the Badari male skulls are the same as for longest 
Egyptian series available, viz. the E series of 26th — 30th Dynasty skulls from 
Qizeh described by Professor Karl Pearson and Miss A. Q. Davin*. The male 
Egyptian E standard deviations are in column 6 of the table and by using them 
the presumed probable errors of means for 35 Badari skulls are obtained and 
these are given in the last column. The numbers on which Miss Stoessiger*s male 
means of cranial measurements of the Badari series are based actually range 
from 32 to 86. A comparison of the constants in columns 4 and 7 of Table I shows 


TABLE I. 

Data relating to the Differences between Miss Stoessigers {S,) and Professor Derrffs 
(2>.) Measurements of the 1924 — 26 Badari skulls. 


Character 

No. 

of com- 
parisons 

Maximum 

diiferenoe 

(D.^S.) 

Mean 

difference 

(D.~S.) 

Standard 
deviation of 
differences 

Male of 

Egyptian E 
series 

Presumed 
probable 
error of mean 
for 35 skulls 

L 

ry.i 

-2*0 

-•32 ±*060 

0-65 + *043 

5*72 ±*09 

•65 

B 

52 

- 3*5 

+ •02 ±*068 

0-73 ± 048 

4*76+ *08 

•54 

R 

53 

+ 2-5 

+ -40+ -067 

0-72 ±*047 

4*05 ± -06 

•46 

W 

52 

+ 3*0 

- -34 ±-065 

0*70 ± *046 

5*03+ -OSt 

•67 

U 

53 

H-6*5 

+ 1*29 ±-200 

216±-142 

13-77 ±-22 

1-57 

Si 

8 

-1-0 

-•06 

— 


— 

S 2 

8 

-f -2-0 

+ •13 

— 


— 

S‘^ 

8 

- 2-0 

- -37 

— 

— 



8 

-2-5 

-*31 

— 

— 

— 

J 

33 

+ 2*5 

-•03+ 087 

0*74 ±-061 

4-57 +*08 

•52 

OB 

45 

-5*4 

-•43+ -145 

l-44±-102 

4*67 ±*08 

•53 

LB 

53 

+ - 1*0 

-•15 ±-036 

0*39 ±026 

3*97 ±*06 ] 

•45 

a 'II 

47 

- 2*5 

-•79±064 

0-65 ± *045 

4-15+ -07 

•47 

NH, L 

47 

- 2-3 

-*64+ -064 

0-65 ±-045 

2-92 ±-06 

•33 

NB 

47 

-2*1 

+ •07 +*053 

0-54 ± -038 

1-77 + -03 

-20 

Oi\ It 

46 

-2*9 

-•76 ±*075 

0*75 + *053 

1 -67 ± ‘OSt 

•19 

li 

45 

+ 3-2 

- -23+ *071 

0-71 + *050 

1-91 + -03 

•22 

Oa, L 1 

44 

--1*4 

-•49+ -050 

0*49 ±'035 

1-88 ±-03 

•21 

Vjl 

« 

- 2" *5 

-*’•58 

— , 




Wi 

3 

+ 2-8 

+ 1*37 

— 

— 

— 

hi 

2 

-0*4 

- 10 

— 

— 



rb' 

5 

+ 0*2 

i 

+ •02 

1 

-J 

— 


that the ratio of the mean difference between the readings of the two observers to 
the presumed probable error of the male mean for the series ranges from *04 to 
4*0 (!). If the former constant is less than 0*7 of the latter the effect of personal 
equation on mean values may be considered negligible, and this is so for X, B, H \ 
X, LB and NB\ if the ratio is between 0*7 and 1*4 {B\ OB and O^^R) the 
"errors” will affect the comparisons appreciably but probably not render them 

* ‘*Od the Biometrio Constants of the Human Skull.” Biometrika^ Vol. xvi. (1924), pp. 828 — S63. 
f For the vertical height from the basion (H) in place of the basio-bregmatic height {H'), 

X For the “curvature” (Oj, B) in place of the dacryal (O/, JR) orbital breadth. 
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misleading; if the ratio is greater than 1*4 (O'H, NH,L, and Ot^L) it may 

be supposed quite unsafe to neglect the consideration of personal equation in 
comparing means given by the two observers for a series made up by approximately 
35 specimens. As measurements of variability will also be wanted for the series, 
it is necessary to consider the standard deviations, as well as the means, of the 
distributions of differences between the two sets of readings for the same skulls. 
These are given in Column 5 of Table I and they may be compared with the 
Egyptian E male standard deviations in the following column. If the former 
constant is less than 0’2 of the latter in the case of a particular character the 
agreement may be supposed satisfactory, and this is so for Z, Z', H\ U, t/, LB 
and O^H\ if the ratio is between 0*2 and 0*4 {OB, NH,L, NB, 0%,R and 0*,Z) 
the agreement is less satisfactoiy, and if greater than 0*4 (Oi\R) decidedly unsatis- 
factory. By applying these arbitrary tests to the means and standard deviations 
of the differences between Miss Stoessiger's and Professor Derry’s readings on the 
same skulls, it is possible to group the measurements for which they used similar 
or identical definitions in the following way : 

(a) those for which agreement is sufficiently close to make it possible to neglect 
the effect of personal equation — L, B, H', J, LB, and probably S 2 , Ss, S, hi and 
rb', though few data for these last six from which estimates of their accuracy can 
be obtained are available ; 

(b) those for which agreement is less satisfactory, so that there is a danger of 
personal equation affecting comparisons between constants for series appreciably, 
though not to a marked extent — B', U, OB, NB, O^tR', 

(c) those for which comparisons between the constants will be definitely invalid 
owing to personal equation — O^H, NH,L, Oi\R, and 02,L, and probably P/L and 
Wi, though few data for these last two are available. 

There can be no doubt that personal equation would be of much less account 
if measurements of two workers in the former Biometric Laboratory were being 
compared than in the case of the present comparison, but this might have been 
anticipated. 

3. The Sexing of the Badari Series, Three estimates of the sexes of the 
adult 1924 — 26 Badari skulls are available. The first of these to be published 
is that of Miss Stoessiger s paper in which it is said that the specimens were sexed 
by Professor Karl Pearson and the present writer. There are 36 males and 22 
females distinguished, and 53 of these have been sexed by Professor Derry. He is 
willing to accept the determination given in the paper in Biometrika in 47 cases, 
but differences must be recorded for the remaining six which are : Nos. 6394, 
S. c/*?, D. ? ; 5436, S. 5438, S. c/*. D. ? ; 6743, S. </, D. ? ? ; 6004, S. </*, 

D. J ; 5383, S. ? , D. The third sexing of the series is that given in the report 
on the excavations* from which the following is quoted (pp. 19 — 20): 

* Tlie Badarian Civilisation and PredynofUc Remains near Badari, By Guy Brunton and Gertrade 
Caton-Thompson, BritiBb School of Archaeology in Egypt and Egyptian Hesearch Account, Thirtieth 
Year, 1924. London, 1028. 
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** There was no sign that the bodies had been preserved in any way. The skin 
was sometimes still visible, occasionally well preserved, but the internal organs hod 
always disappeared.... As regards segregation of sexes in cemeteries, a curious fact 
emerges. Female graves were not placed apart ; where there are graves of women, 
there are also graves of men alongside. Certain areas, however, were reserved for 
men.... The most easily observed feature was the hair.... It is interesting to note 
that in no case could a beard or moustache be detected.” 

The method of sexing used is not described, but it appears from the remarks 
on individual skeletons that the length and condition of the hair was taken as 
one criterion, and the nature of the grove furniture also gave some guidance 
presumably. Sexes are given in the report (A.R.) for 42 of the skulls sexed both 
in Miss Stoessiger's paper (S.) and by Professor Derry (D.). It is found that for 
24 of these, all three estimates agree ; for 30, A.R. agrees with S. ; for 35, A.R. agrees 
with D., and for seven A.R. differs from both S. and D. The last seven are Nos. 5411, 
A.R. ^ ** long wavy black hair arranged in twisted tresses ” ; 5745, A.R. ^ “ hair 
thick, black and wavy, bracelet ” ; 5815, A.R. ? “ dark, slightly wavy hair ” ; 5350, 
A.R. </*; 5360, A.R. 5368, A.R. ; 5370, A.R. “in male cemeteiy.” A re- 
examination of these seven skulls suggests that an anatomist would have little 
hesitation in assigning sexes to them opposite to those given in the report, in spite 
of the fact that the sexing of the Badari skulls is particularly diflScult owing to 
the fact that the type is a feeble one showing little muscular development. It 
is unfortunate that there is no closer agreement between the three estimates, 
and the effect of the discordance on means must at least be examined. Of the 
measurements which have been found to be least affected by personal equation, 
the three major calvarial diameters are the ones most suitable to use for this 
purpose and means for them are given in the following table, computed only 
for the 42 skulls for which all three estimates of sex are available and using 
Miss Stoessiger’s measurements in every case. 


Sexing 

Bex 

L 

B 

H'* 

Stoessiger ... 

Derry 

Archaoologioal Report 

s 

183*4 (27) 
183*6 (23) 
182*2 (24) 

131*2 (27) 
131*9 (23) 
131*6 (24) 

134*0 (26) 
134*6 (22) 
133-6 (23) 

Stoessiger 

Derry 

Archaeological Re|>ort 

9 

177*2 (16) 
178*3 (19) 
179*7 (18) 

131*1 (16) 
130*2 (19) 
130*6 (18) 

128*9 (16) 
129*4 (19) 
130*3 (18) 


The means are seen to be in better accordance than might have been anti- 
cipated. For L and H' those given by Miss Stoessiger and Professor Derry agree 
better than either set does with those based on the sexing in the Archaeological 


If' is not given for one skull which is male according to all three estimates. 
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Report, but for B the last two are in closest agreement. In the case of two of the 
estimates, a similar comparison can be made based on 53 skulls, and the means are: 


Sexing 

Sex 

I. 

B* 

H'f 

Stoessiger 
Derry . . . 

6 

182-2 (33) 
182-2 (21); 

130- 0 (33) 

131- 5 (20) 

133-1 (32) 
133-4 (28) 

Stoessiger 
Derry ... 

9 

177-0 (20) 
177-9 (24) 

i;i0-2 (19) 
129-6 (23) 

128- 9 (20) 

1 29- 3 (24) 


The pairs of means here are actually all closer together than the corresponding 
ones based on the 42 skulls. We are comparing two anatomical estimates of sex 
which almost certainly differ owing to the interchange of a few doubtfully male 
or doubtfully female specimens, and the way in which these are assigned does not 
appear to influence the means greatly. But still the greatest observed differences 
between the means are of an order which would have indicated unreliable measure- 
ments if they had been found as mean differences between the readings given by 
two different observers (cf. data in Table I). The personal equation in sexing is 
hence a factor which is likely to affect comparisons between series appreciably. 
The sexes given in Miss Stoessiger’s paper were accepted in making the com- 
parisons in the following section of the present paper since they are given for all 
the specimens, while the other two records only give sexes for parts of the total 
1924 — 25 series. It is not suggested that the former sexing is more accurate than 
the others when available. For the 1928 — 30 series, we have only Professor Derry’s 
determinations to follow. 

4. A Compiirison of the 1924 — 25 and 1928 — 30 Badari Series. Constants 
are given in Table II based on Miss Stoessiger’s measurements of the 1924 — 25 
series and Professor Derry’s of the 1928 — 30 series. The data there only relate 
to those characters for which it has been shown that comparisons between the 
measurements of the two observers arc not likely to be vitiated to a marked extent 
owing to their personal equations J. Standard deviations have not been calculated 
for the series excavated on the two occasions, since it is clear that their means are 
in close agreement, but a non-metrical comparison of their distributions suggests 
that they have approximately the same variability as one another and as the 
combined siimple in the case of every character. The standard deviations of the 
hist were calculated and the probable errors of the means are given (= p.e. Mp , 
say). The probable errors of the means of the two component samples will certainly 

* 1? is not given for one skull which is female according to both estimates. 

+ H' is not given for one skull which is male according to both estimates. 

X All such are included in Table II (see p. 302 below) except the height of the right orbit (Og, Ji). As 
the height of the left orbit was found to be an unreliable measurement, it appeared safer to class 
with those of that kind and data for it are given in Table III. 
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be larger than these, but their differences may be compared with V2 x (p*e. Mp)^ to 
give an estimate of whether they are significant or not. The difference divided by 
this quantity is greater than 3 in the case of the male 100 H* JL (3*4), (3*6) and 

J (3*9), and the female S (3T), (3*3), NB (3 8), 100 (3*9), OB (4 5), B 

100 H^fL (4*7) and L (4*9). The ratio is greatest for the last comparison, and if it 
is assumed that for this the standard deviations of the component distributions 
are the same as for the pooled distribution — which is a reasonable assumption — 
then the female mean L for the 1924 — 26 series will be 176*7 ± *52 and for the 
1928 — 30 174*4 ± *43. The difference between these is 3*4 times its probable 
error. It will be safe to suppose that the differences between the mean measure- 
ments of the two separate series are all insignificant, except possibly in the case 
of the female values for Z, B, OB and 100 H* jL, and no clearly significant differ- 
ences are found for these. The conclusion that the Badari series excavated on 
the two occasions represent the same racial type appears to be sufficiently justified 
and hence they have been pooled. 

This conclusion might be invalidated if clearly significant differences were 
found for any character not in Table II. Table III gives the means for those 
characters for which identical or very similar definitions were used by the two 
observers, but for which their readings on the same skulls were found to differ 
appreciably or markedly. It will not be legitimate to pool data for these characters. 
The ‘"divergences expected” in the last column of the table are those actually 
found (see Table I) between the mean measurements of the two observers for the 
same skulls. The constants for the 1924 — 25 and 1928 — 30 series in Table III 
are still in close agreement, and it will be safe to assume that none indicate 
possible difterentiation of the types — paying due regard to the divergences 
expected owing to personal equation in comparing the means — except possibly 
in the case of the upper facial {O'H) and nasal {NH,L) heights and indices, 
and in the case of the bicondylar breadth of the mandible (t^^i). The divergence 
expected in the case of the last character is oply based on the comparison of 
measurements of three specimens, and hence it is quite likely to be misleading, 
and Miss Stoessiger’s and Professor Derry’s readings may be directly comparable. 
But the measures of personal equation in the case of the other characters which 
are showing larger differences than would have been anticipated are based on 
adequate numbers of comparisons. The standard deviations were calculated for 
the two series separately giving the results provided in Table III, p. 303, the means 
for the 1928 — 30 series being given values which they would presumably have had 
if measured by Miss Stoessiger. 

The difference between the male means for O'H is here 3*4 times its probable 
error and the difference between the female means for NHyL is 3*8 times its 
probable error, while the other two differences are quite insignificant. It appears 
that the 1928 — 30 type had a significantly higher face than the 1924 — 25 type, 
though part of the observed difference may possibly be due to the fact that the 
series were sexed by different observers. The discordance is not sufficiently marked 
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Oharaoter 

Means 

Standard Deyiations 


9 

i 

9 

1924—25 

1938—80 

1924—25 

1928-80 

1924—25 

1928—80 

1924—25 

1928—80', 

O'B 

NH,L 

67*1 ±-46 
(34) 

48'4±-32 

(34) 

[60-3l±*46 

r (1®) 

[40-4] ±*36 
(43) 

64-8 ±-63 
(20) 

46 -6 ±-42 
(20) 

[66 -oil -33 

m 

[47-41 + -23 
(34) 

3-95 ±-32 

2*78 ±'23 

4-41 ±-32 

3-47 ±-26 

4-18±-45 

2-81 ±*30 

2*83 ±-23 

1-99 ±'16 


TABLE HI. 

Mean Measurements of two Badari series, from Readings hy Miss Stoessiger (S.) 
and Professor Derry (D.), between which Agreement is not eospected owing to 
Personal Equation, 


Character 

s 

9 

Divergence 

expected 

(S.-D.) 

1924—25 (8.) 

1928—80 (D.) 

1924—25 (S.) 

1928—80 (D.) 

O^H 

67’1 (34) 

68*5 (43) 

64*8 (20) 

65*2 (34) 

•f *8 

NH,L 

48-4 (34) 

48*8 (43) 

46-6 (20) 

46*8 (34) 

+ •6 


38-4 (33) 

38-3 (38) 

37*6 (20) 

37-2 (32) 

+ •8 

0\,L 

— 

37-4 (37) 

— 

36*3 (32) 

— 

O^R 

32-0 (34) 

31*5 (38) 

31*3 (21) 

32-2 (32) 

+ •2 

0%,L 

32-1 (33) 

31*6 (37) 

31*4 (18) 

31-9 (33) 

4* ‘5 

Pl 

84“-0 (33) 

82“-2 (41) 

83®-3 (20) 

8r-2 (33) 

+0®*6? 

100 Q'HIQB 

70*9 (34) 

72-6 (41) 

71*7 (17) 

70*5 (33) 

— 

imNBINH,L 

51-6 (34) 

50-9 (42) 

61*8 (20) 

51*9 (34) 

— 

100 O^IOi^R 

83*3 (33) 

82-2 (37) 

83*4 <20) 

86*6 (32) 

— 

100 0^l0i\L 

— 

84-7 (37) 

— 

87*8 (32) 

— 

Wi 

1 109*5 (30) 

109-9 (36) 

105*2 (16) 

104*1 (30) 

- 1*4? 


to render the pooling of the series unjustifiable. Table IV gives the means for the 
remaining measurements taken by Professor Derry on the 1928 — 30 series and for 
similar, but differently defined, measurements taken on the 1924-- 25 series by 
Miss Stoessiger. It is clear that entirely fallacious conclusions would have been 
reached if some of these corresponding pairs had been supposed comparable. 

5. The Nature of the Badari Series, Certain features of the Badari series can 
be estimated from the data given in Tables II — IV above. The characters for 
which pooled means are given (Table 11) have constants based on fairly adequate 
numbers of skulls and the sex ratios (male mean/female mean) for the absolute 
measurements are found to be unusually small, though of the same order as those 
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TABLE IV. 


Mean Measurements of two Badari Series for coi'responding 
hut differently defined Characters, 


Prof. Derry’w 
moa»urementu 


1928—30 (Derry) 


<$ 


¥ 


Corresponding 
but differently 
defined 
biometric 
measurements 


Monaco traiisvorHe arc ... 
Auricular heigiit 
Choixl baa.'prosth. 

1 1 ) terorbital broa< Ith 
101) (Jhord baa. - pros til. /X/i 
Bigoiiial breadth 
Max. breadtli of ramus ... 
Length of corpus 
Length of ram us ... 

M aiidibular angle 


206*1 (41) 
115*7 (41) 
9S*r> (40) 
21 *7 (40) 
97*4 (40) 
91 *6 (40) 
42*4 (39) 
79*9 (40) 
61*1 (38) 
117‘’*7 (41) 


283-0 (32) 
111-8 (33) 
94-2 (33) 
21-1 (32) 
97-7 (33) 
85-8 (31) 
40-2 (33) 
75-3 (33) 
55-0 033) 
12r-0 (33) 




OJJ 

GL 

DC 


w.. 



Ml. 


1924 — 25 (Stoeasiger) 




9 


302-0 (34) 
111*0 (34) 
95*0 (.33) 
22*4 (32) 


298*8 (20) 
108*6 (21) 
92*6 (20) 
20*6 ( 21 ) 


88-8 (32) 


86*2 ( 21 ) 


76*2 (33) 
57*6 (33) 
120"*0 (34) 


74*1 (19) 
53-2 (18) 
J23 ’*3 (19) 


observed for other series*. Owing to the difficulty of sexing the Badari series, 
it would be unwise to attach much importance to this result. The mean cephalic 
and breadth-height indices are about one unit less for the male than for the female 
series, as is usually found. Other expected relations arc that the female orbital 
index (Table III) and mandibular angle (Table IV) are greater than the male 
values. It is probable that the <lifferences between the means for the two sexes 
in the case of all the other indices and of the profile angle are quite insignificant. 
There is every reason to believe that the male and female series represent the 
same racial type. 

Sexual comparisons in variability can only be made in the case of the 18 
characters dealt with in Table II. If coefficients of variation are used for the 
absolute measurements and standard deviations for the indices, it is found that 
for 15 of these the male variability is the greater. Dividing the difference of the 
constants by its probable error gives values over 3 in the case of S (3T), H' (3*3), 
Sx (3*4), (3*4), L (5*3) and U (5*6), while for all of these it is the male vari- 

ability which is the greater. A similar clear preponderance of male over female 
variability has been observed in the case of the long Egyptian B series of the 
26th — 30th Dynasty skulls, but for other Egyptian and non-Egyptian series it 
has been far more usual to find a closer approach to sexual equality in this respect. 
The material suggests that there are racial differences between sexual differences 
in variability and the Badari type appears to be close to one extreme of the range. 
It is possible that inaccuracies in sexing have favoured the last conclusion. 

* Sex ratios for seven calvarial measurements are given in Bionietrika, Vol. xxv. (1933), p. 269 in 
the oase of the Kerma, the E Egyptian and the Teita negro scrios. The Badari values for each oharaoter 
are smaller than any shown there, though the difference between the Badari and the next smallest 
sox ratio is small in every oase, its maximum value being *009. 
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A Gomparison of the variability of the Badari and other series is of particular 
interest since it is believed that we are dealing with the earliest cranial sample 
of any length which has been obtained from any part of the world*. It is implied, 
or distinctly stated, in a good deal of the literature of physical anthropology that 
present-day populations are racially far more heterogeneous, and hence far more 
variable, than those which existed at the beginning of the Christian era, say, and 
that by going back further in time we must approach closer and closer to that 
ideal and undefined population which may be called a ** pure race. There appears 
to be no valid reason for attempting to divide the Badari series into sub- samples 
which would show lesser variabilities than the total sample. It is reasonable to 
assume that the people represented formed part of an intermarrying group which 
could not be divided into racial components owing to their complete inter-mixture 
for several generations. The biometrician accepts such a group as representing a 
racially homogeneous type, and comparisons may be made with other samples 
presumed to have been drawn at random from similarly constituted populations. 
The Badari variabilities for the 16 cranial characters in Table II (omitting the 
mandibular measurements hi and r6') are compared below with those of the long 
Eg 3 rptian JS series, using coefficients of variation for the absolute measurements 
and standard deviations for the indices : 

Male. — Badari variabilities greater 11 characters, Egyptian E greater 6 ; 
differences exceeding three times their probable errors B (3*2, Badari greater). 

Female. — Badari variabilities greater 7 characters, Egyptian E greater 9 ; 
differences exceeding three times their probable errors 100 H'jL (3*2), L (4*2), 
Sz (4*6), U (4*8), 8i (5*6), the Egyptian E being the greater in these five cases. 

It appears that the male Badari series is slightly more variable than the 
male Egyptian but for the females . the position is reversed and the difference 
is more significant. The Egyptian E has been compared with several other cranial 
series in a similar way, and it has been found that its variability is slightly less 
than those of samples from different parts of the world which are presumed to 
represent racially homogeneous populations. The following comparison is between 
the Badari constants and those given for the Farringdon Street English series which 
was obtained from a single London cemeteiy used in the seventeenth century j*, no 
selection of specimens on which measurements could be taken having been made ; 

Male. — Badari variabilities greater 4 characters, Farringdon Street greater 12 ; 
differences exceeding three times their probable errors OB (3*1, Farringdon Street 
greater). 

* Since the later discovery of Tasian remains, the Badari is generally considered to be the second 
oldest culture of predynastio Egypt known. If the beginning of dynastic times is dated at 8000 d.c. and 
the length of the predynastio period is estimated as a minimum of 2000 years, then the Badari people 
must have lived as long ago as 4000 b.c. and probably earlier. Most authorities apparently agree in 
making them ante-date all European neolithic cultures by a considerable period. See, for example. 
Miles Burkitt and V. Gordon Childe: “A Chronological Table of Prehistory.*’ Antiquity^ Vol. vi. (1932), 
pp. 185—205. 

t Beatrix O. E. Hooke, <'A Third Study of the English Skull with special reference to the Farringdon 
Street Crania.” Biometrika, Vol. xvm. (1926), pp. 1 — 55. 
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Female. — Badari variabilities greater 2 characters, Farringdon Street greater 14; 
differences exceeding three times their probable errors LB (3*4), NB (3*6), (4*3), 

H' (4*8), 8 (5-3), 100 H^jL (5*6), (7*6), U (7*7), L (8*8), the Farringdon Street 

being the greater in these nine cases. 

While there appears to be no marked difference in variability between the two 
male series, the Farringdon Street female must be considered to be very signifi- 
cantly more variable than the Badari female series. The last conclusion is likely 
to convey a false impression, however, since it may suggest that the one series is 
much more variable than the other. To obtain some measure of the order of the 
difference, the Farringdon Street constant was divided by the Badari constant in 
the case of each character, and then the average of these ratios was found for the 
16 characters. This was also done in the case of the other three comparisons above 
and the results are : with Egyptian Ey male 0*952, female 1*073 ; with Farringdon 
Street, male 1*080, female 1*281. Judging in this way, the English female variability 
is thus of the order 1*3 times the Badari female variability, which is an extra- 
ordinarily low ratio, since we are here dealing with series accepted as representing 
racially homogeneous types which show a divergence in variability approaching 
the maximum found among such series *. In the last 6000 years there appears 
to have been little change in the variability of racial populations, and it is safe to 
conclude, perhaps, that ** pure ” types much less variable than any found to-day 
have only existed in that period in the imaginations of some anthropologists. 

6. Racial Comparisons, Mean cranial measurements based on 30 or more 
male specimens have been collected for 150 series representing racial types from 
different parts of the world, excluding many available for ancient Egypt and 
Europe to prevent these regions being over-represented. It is found that the 
Badari means are quite close to the inter-racial averages (i.e. the unweighted 
averages of the serial means), except in the case of: B which is small, though 
12 out of 150 means are smaller; B' small, but 7 out of 126 smaller; J small, but 
2 out of 141 smaller; U small, but 14 out of 116 smaller; 100 BIL low, but 17 out 
of 150 lower; 100 BIH' low, but 15 out of 148 lower; EJL, L small, but 19 out of 
144 smaller ; O 2 small, but 4 out of 148 smaller (than Stoessiger^s mean 31*5) ; 
^2 large, but 17 out of 98 larger. The Badari type is thus seen to have cal- 
varial and facial breadths which are unusually, but not extremely, small, while its 
calvarial antero-posterior lengths (apart from the 8% arc from bregma to lambda) 
and heights are not at all exceptional. The horizontal circumference (27) is hence 

* There appears to be only one cranial series, for which the constants have been published, which 
is appreciably less variable than the Badari and Egyptian E, This is one of Guanche skulls measured 
by Prof. E. A. Hooton (“The Ancient Inhabitants of the Canary Islands.’* Harvard African Sttidies, 
Vol. vTi. 1925). Data for 12 of the 16 cranial characters in Table II above are given. Dividing the 
Tenerife constants — coefficients of variation for absolute measurements and standard deviations for 
indices — by the Badari values gives for these a mean male ratio 0*914 and a mean female 1*040. 
It is not surprising to find that the population of a small island shows peculiarly gmaU variation since 
inbreeding must have b^n common. Most European samples — whether derived from single cemeteries 

or from several cemeteries believed to have been used by closely related groups of people show 

variabilitos which are close to the Farringdon Street values. 
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small and the cephalic index low. Capacities could not be found directly owing^ to 
the fragility of the skulls, but the average for them must be unusually small. The 
^ial skeleton has unusually small heights. Comparative material for the man- 
dible is scanty, but the Badari bicondylar breadth (ii;x) appears to be the smallest 
male mean recorded, thus agreeing with the conclusion that the breadths of the 
cranium are unusually small. Comparisons between the Badari means for single 
characters and those available for other ancient Egyptian series have been made 
by Miss Stoessiger*, and it has been shown that the Badari type diverges 
slightly from them in being more prognathous and in having a higher nasal index. 
In these respects it is more negroid. 

The racial relationships of the Badari type can be judged better from coeffi- 
cients of racial likeness. There are 27 reduced values, of which several have been 
previously published, in Table V below and the male means gfiven by Miss Stoes- 
siger were used in computing all these constants. New means based on larger 
numbers of skulls are now available for 12 of the characters used in computing 
the coefficients, but these are all very close to Miss Stoessiger's means, and by 
using hers in all cases we have all 31 means based on approximately the same 
numbers which is an advantage. As would have been anticipated, the closest 
connections are found with other Predynastic Egyptian series. The dynastic series 
are further removed, and types more similar to the Badari than some of them 
are found in Abyssinia to-day and also among the criminal classes in Cairo (Sydney 
Smith). All other African types with which comparisons have been made are 
more widely removed from the Badari t 3 rpe. Miss Stoessiger found that there is an 
extraordinarily close similarity between the last and some Indian types of cranium, 
and the revised coefficients in Table V confirm her conclusion. A comparison of 
the reduced coefficients there with several hundred others, published and un- 
published, suggests the following conclusions : 

(i) The Badari type resembles most closely those of the other Predynastic 
Egyptian series available. Its connections with these are as close as those normally 
found between different samples representing the populations of the same country 
at different times, but in the same era. None are so close that the Badari and 
any other pred 3 mastic series can be supposed to represent precisely the same 
population The Badari resembles the other early predynastic series more 
closely than it does the late predynastic series. With advancing time the type 
in the predynastic era was changing by becoming less prognathous and by ac- 
quiring slightly lower nasal and higher cephalic indices. A slow progression in 
the same direction appears to have continued throughout the whole of the dynastic 
era and down to Boman times in Upper and Middle Egypt at least. The later of 
the known Eg 3 rptian types for this period differ quite markedly from the Badari 

* Loe, cit, p. 124. 

t This seems to be the safest oonolasion to draw In spite of the fact that an insignificant coefficient 
of racial likeness is found between the Badari male means and those of the Early Predynastic aeries 
from Abydos, El- Amrah and Hou measured by Thomson and Maclver ; only 14 of the 81 measurements 
used whenever possible in computing the coefficients are available for the latter. 
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type in being less prognathous, and in having lower nasal but higher cephalic 
indices. 

(ii) Close connections can be found between these later Egyptian types and 
some representing European populations in contemporary or later times, but the 
Badari and other Predynastic Egjrptian types bear no close resemblance to any 
recorded for a European population other than that of modern Sardinians. The 
connection between the Badari and Sardinian series is not veiy close, but it is 
closer than that found between the Predynastic and some late D3ma8tic Egyptian 
series, 

(iii) Connections of the same order are found between the Badari and all the 
modem Indian series available, the most intimate being with a Dravidian series. 
The other Predynastic Egyptian types are rather further removed from the Indian, 
and between the last and the Dynastic Egyptians no close resemblances are found. 

(iv) A wider gap separates all the Egyptian from African negro types, but the 
resemblances between these and the Badari and other Predynastic types are appre- 
ciably closer than those between the negro and late Dynastic Egyptian types. 

The interpretation of the observed situation in terms of racial descent must 
necessarily be hazardous in the present state of our knowledge, and far more 
material than that available at present may be needed in order to reveal the 
actual relationships, which are almost certainly complex, between the populations 
represented. Meanwhile it is of interest to note that the Badari racial type, which 
is believed to be the earliest of which we have any adequate knowledge of a 
statistical kind, is also peculiarly generalised. 
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THE RATIO OF THE MEAN DEVIATION TO THE STANDARD 
DEVIATION AS A TEST OF NORMALITY. 


By R. C. GEARY, M.Sc. 


The problom to be considered in this paper is the following. A sample of n' 
elements with measures ocxy ... iPn'> Is drawn at random from a universe which 
is presumed continuous : to determine from the sample whether the universe may 
be regarded as normal, that is to say whether the probability of drawing therefrom 

an element, measuring from ^ to a? 4* when hw is small, is 


1 

cr 




( 1 ). 


Two cases are considered : (i) that in which the universal mean a may be presumed 
known, but the standard deviation a is unknown, and (ii) that in which both the 
mean and standard deviation are unknown. 


The usual method of dealing with this problem, and that which in principle is 
adopted here, is that of devising a function, or functions, of the sample variables 
a?!, ora, ... which assume characteristic values for infinite samples (i.e. n'«* oo ) 
drawn from a normal universe. We try to find the actual or approximate frequency 
distributions, for different sizes of sample, on the assumption that the unknown 
parent universe is normal. If, for the actual sample, the values of such functions 
are very improbable, we decide that the universe from which the sample was drawn 
is not normal. 

It is quite obvious that in problems of this kind we can only determine 
“necessary** (and not “sufficient**) conditions. We may hope to be in a position 
to say that the parent universe was not A; we can never say oh the evidence of 
the limited sample alone that it probably wds drawn from a universe B, In the 
first place, the number of independent tests which may be applied is strictly 
limited: it cannot exceed n\ the number in the sample, when the parent universe 
is completely known. In fact, if n' algebraically independent functions of the n' 
variables be constructed, all other functions of the variables will be algebraically 
related to these n' functions, and their frequency distributions will in consequence 
be absolutely determined by the multiple frequency distribution of the w' functions. 
If all the n* tests showed that the given sample could be regarded as not im- 
probably having been drawn from a universe B, it could equally plausibly be 
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regarded as having been drawn from an infinity of other universes differing from B 
(in functional form, or in the values of the parameters) to a greater or lesser extent 
according as n' is small or large. 

The range of choice of possible tests of normality is extensive. Clearly it is 
desirable that the test functions should satisfy the following conditions : 

(i) They should be as simple as possible, not only for convenience of calculation 
but because the simpler they are the more likely is it that their frequency distri- 
butions, or at least their moments and Thiele semi-invariants, can be determined. 

(ii) They should be symmetrical in the sampled variables so as to avoid bias, 

(iii) When n' tends towards infinity they should assume values characteristic 
of the normal universe. 

(iv) They should be independent of the unknown parameters a and <r [see (1) 
above], in the sense of being invariant for the transformation 

2 , 

Hitherto* the fuactions V/S, and l3», with 

/3i = tn,*fmt\ /8, = mtlnit* 

n* W n* n 

and n'ma = 2 (a?- — fl?)® w'ms®* 2 — ^)®, n'm*— S — S)*, 2 

t'scl tal jal 

have been used for testing normality. They obviously satisfy conditions (ii) and (iv) 
above. With regard to (i), they are possibly the simplest functions of the sample 
moments which satisfy the other conditions. Their numerical computation is not 
difficult. With regard to condition (iii), and /3% assume the values zero and 3 
respectively when n' tends towards infinity. 

From the work of C. O. Craig and R. A. Fisher the lower moments of and 

for normal samples may be regarded as completely determined for all values of n\ 
R. A. Fisherf has gi^n the exact values of the second, fourth and sixth Thiele 
semi-invariants of VySi — the odd semi-invariants are of course zero, since is 
symmetrically distributed for normal samplea These values show that for quite 
small values of n' the fourth and sixth semi-invariants approach closely to their 
normal values zero. V/3i cannot, however, be regarded as an efficient test of 
normality. Infinite samples drawn from all symmetrical universes will give a 
zero value of /3i, Accordingly it is only a test of symmetry and cannot be expected 
to distinguish the normal from other symmetrical universes. 

I have used the formulae for the Bx and B% of /3^ for normal samples, which 
K S. Pearson]: has derived from R. A. Fisher’s results, in the computation of the 
following table. 

* [The P, X* and P, tests have been applied, and are available. En.] 

t ** The Moments of the Distribution for Normal Samples of Measures of Departure from Normality. 
Proe. Bayal Society of London^ Series A, Vol. ozxz. p. 22. 

X **NoteB on Tests for Normality." Biometrikat Vol. xxii. Parts 111 and lY. 
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TABLE A. 

Values of s/Bx and of {Normal Samples), 


Size of 
<iample n' 



0 

0 - 7 ] 

2*91 

11 

1*48 

6*01 

26 

1*75 

8*93 

36 

1*69 

8*89 

61 

1*57 

[ 8*38 

101 

J 

1*27 

L,. 

6*75 


Even in samples of 101 the value of Bx is 1*27 and the value of B 2 is 6*76, very 
far removed from the normal values 0 and 3 respectively. Furthermore it is 
evident that the approach to normality of the distribution of improves 

very slowly with increasing size of sample. As the prospect of obtaining the exact 
or even approximate frequency distributions for for all values of n' seems 
remote, and as the distribution of S 2 for large samples cannot be presumed normal, 
it appears desirable to try to devise other tests which do not suffer from these 
disadvantages. 

The value of the mean deviation d for infinite normal samples is 


1 r+00 

d — — 7 = l^ — ale dx 

crV27rJ-oo' 



O’, 


so that 


- = V- == -7978845608... . 


The test which will now be examined is that of estimating dja from the 
and attempting to determine the probability, or improbability, that the 

universe was normal from the difference between the estimate and • 


sample 

parent 


Moments of 

Suppose that, in the first place, the mean of the parent normal universe is zero. 
Later on the reduction of the most general case of the universal mean unknown 
to this case will be discussed. A random sample of n is drawn with measures 
yi, yai •••! yn, and the estimate of dja is taken to be 

( 2 ). 


with ns^ fm'S.yiK 

It will be evident at once that 

Furthermore the frequency distribution of Wn will obviously be independent of cr, 
the unknown universal standard deviation, which in consequence may be presumed 
unity, without loss of generality. 
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The probability of obtaining the given sample is then 

Change the variables into generalised polar coordinates (p\ <f>o, <t>i, 
and for brevity write Si for sin Ci for cos^^. Then 

yx “ «n-S • • • 
y% *= p8n-% «n~8 ••• SiCq 

yz^ P ^n— • ^n— B • • • 0l 


yn— 1 ■■ P^n~-% Cn— S 
yn ” P Cn— 2 

SO that 2y<® = p* = 


<^n-a). 


(3), 


and the integral element becomes 

pS 

^ ® ^<#>0 . . . »"r5 3^n~a (4), 

from which it follows that p(='^ns) and the n— 1 polar angles are mutually 
independent. After the transformation, Wn, given by (2), is a function of the polar 
angles only and therefore independent of 8 in normal samples. Indicating as before 
universal mean values by square brackets, it follows that 


[\y\P] = [Wn^8P2 = M, 


where p is any whole number, so that the moments (fixed origin) of Wn are known 
when the corresponding moments of |y| and s are known. Now, from (4), the 
frequency distribution of 8 is 

_ 

Ge ^ (0 < a < 00 ), 

so that the universal moments of a are as follows: 


«i' = w 

*.' = [«*] = ! '1 


o ^ ^ 2 



= [«r, = 0^ + 1) (” + 3) (« +.5) 

^ [Jl"] 

etc. 

> 




f e ^ a” da, 

Jo 
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e * da, 


or (for n even) 

But by Stirling's Theorem 


, n\ /tt 


whence 


1 1 _ 1 

n I •» r?**'*’* V 2 iSr ”^ 12 n 360 n*’^ 1260 n® ICBOn'^ 

. 1 1 , 17 

4n“^24nS 20n»‘^112n7 • 


The universal moments of |y| = ~ 2 |y<| may be determined very easily since 


jo ** ‘^y 


f (jp — 1) — 3) ... 3T when p is even, 

(P - 1) (P - 3) ... 4-2 when p is odd, 

and since the variables are mutually independent. The first eight moments, m*' 
of |y I are as follows : 


»«i' = ^[S|y,l]-^2[|y,|] = ^|-.o 


W “ ^ {«! + 

m9 = ^ {2ni + SriM -h a*7ia} 

m/ = ^ {3wi -f (8a* H- 3) 4- 6a* ns + a^ws} 

== {8ni H- 36wa -f* (20a* + 16) na + 10a* + a* ns] 

n 

»»•' = ^ f 16«i + (88a* + 45) n* + (165a* + 15) ns + (40a* + 45a*) n. 
+ ISa^Ws + a®n«} 

^ [48mi + 483»» + 7 (64a* + 76) w, + 106 (5a* + 1) »• 

+ 36 (2a® + 3a*) ns + 21a® ne+ a^n^] 

TO*' = T {105ni + (1280a* + 786) n, + (4662a* + 630) w, 
n® 

+ (1668a® + 2940a* + 106) + (1330a® + 420a*) ns 

+ (112a® + 210a®) + 28a® n 7 + a®n8} 

where *« n, na *= » (^ — 1 )» na *“ n (n — 1 ) (n — 2 ), etc. 
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The moments, of Wn may be written down at once from ( 5 ), ( 6 ) and (7) sinoe 

^ mu 

Finally the Thiele semi-invariants Xu of can be found from the recurrence 
formulae 

fii* * X.1, 

fiz == 4* Xt, 

~ XifAu + + Xs» 

= X 1 /X 3 ' -f SXf/LAa' + 3Xt/i/ 4- X 4 , 
etc* 

On account of the intervention of the exponential in the formula for 8%*, it is 
not possible to give simple rational expressions for the X* such as R. A. Fisher 
h^ given for the semi-invariants of his functions ktjkji and k^k^ (very similar to 
and >3a). The calculation of the semi-invariants of for any particular values 
of n, step by step as indicated above, is a lengthy process but presents no parti- 
cular difficulty. As the value of Xs (equal to the square of the standard deviation) 
is quite small even for low values of n (see Table B below), and as \u appears to 
be of the same order of magnitude as Xf*^, it is necessary to work to many places 
of decimals in order to obtain sufficiently accurate values for the higher semi- 
invariants. The values of the first four semi-invariants and of and y 8 i for 
certain sizes of samples are given in the following table. 

TABLE B. 


Semi-invariants, etc,, of w^. 


Size of 
sample n 
( = n^-l) 

Xi 

X 2 


X 4 

Vi.»Xs/x,» 

— 

Sj = X4A8*+3 

5 

‘838626 

•00617082 

- *00027074 

•000006184 

-0-66 

3*16 

10 

•818049 

•00375296 

~ *00010696 

•000003667 

-0*47 

3-26 

26 

•805901 

•00167809 

- *00002260 

•000000461 

- 0-33 

3-16 

60 

•801884 

•00086990 

-•00000616 

•000000072 

-0*24 

3*10 

100 

*799882 

•000442786 

- -00000161 

•000000010 

-017 

3-06 


Even for small samples, the values of VBi and B% are quite near the normal 
values. The values of X^/Xt**^ for p «= 3, 4, 6 and 6 for samples of 10 are as 
follows, correct to two decimal places : 

Xb/X,* = - 0*47, X 4 /X,» - 4- 0*26, 

X«/X,^ « - 010, X«/Xb* = - 0 29. 

From this investigation it appears very likely that, for quite small samples 
drawn at random from a normal universe with mean zero, the distribution of is 
&irly close to normal. 
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The General Case. 

In the foregoing analysis it has been assumed that the universal mean of the 
parent normal universe is known to be zero. This case has some practical importance. 
It occurs when the sample variables are themselves differences of pairs of variables 
drawn from the same universe. In the general case if the n' random and inde- 
pendent observations were /ri, ... we might take « a?! — etc. 

(if n' were even), so that if the x*f^ are normally distributed so are the y*s, but with 
universal mean zero> and the Wn of the might be used to test normality. It is 
evident, however, that this procedure would not sufficiently utilise the information 
to be derived from the sample of ic’s. 


The simplest transformation is of course 

(^= 1, 2, ... w'), 


and in fact it is implicit in the tests and y92* It has the great advantage of 
being symmetrical, but cannot be used in the calculation of because the i/'s, 
while normally distributed with mean zero, are not mutually independent. 

The properties of au interesting transformation will now be examined*. Suppose 
that the n' a?*s are arranged in random order, for example in the order in which 
they were drawn. Let 


/a-i + /Tg \ /2 xx ~ 

/a?i + X 2 4 Xz \ /8 Xx 4- x^ — 2/r8 

y* ~\ 3 - j v 2 “ V6 " 


3 

/Xi-\'X2^ ... +Xn' \ / n* a7i4-^2 4- ... 4-£r;,t'-i — (^i'* 

- 1 ■ j V _ 1 - “ - 6?'-^) 


• 1) x,^ 


Xx’k’X2’\- ... 4- Xn’ 

y^' ~ v»' 

The reciprocal is 


^ 4 . ^ 4 - 4 . - 4 - 2 /”' 

V2 VG ^/n' 

rr =_i^4.1i4. 4. yn-l , yn' 

* \/G Vw'(a' — 1) 


Xn>-^1 = 


1^' 

fn' 


/ n'-2 y n'-.i 

y”'-* V n' _ 1 + 


.( 8 ). 


.( 8 '). 


* This transformation has been given by Williafn Burnside in his Theory of Proltability (Cambridge 
University Press, 1928, p. 97). Dr Karl Pearson kindly informs me that it is due to Helmert (1876). 
Its geometrioal properties indicate its fundamental importance. For example, when n' = 4, the four 
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The structure of the transformation is evident: yx represents the diflFerenoe 
(to a numerical factor) between the mean of the first two observations and the 
second, y% the difference (to a numerical fiictor) between the mean of the first 
three observations and the third, etc. The transformation is completed by setting 
y^t equal to Quite apart from its mathematical implications, which will 

presently be examined, the method of analysis of the primary data is therefore a 
very natural one. 

The new variables yi, ya, ... yn* have the following properties : 

(i) They are normally distributed with the same standard deviation as the a?*8, 

(ii) The universal mean of each of the variables yi, ya, ... yn'-i is zero. 

(iii) Each pair of the first w' — 1 variables are uncorrelated and accordingly, 
since they are normally distributed, they are independent. 

(iv) The transformation is orthogonal, i.e. 2^?^® = Sy^*, or 

2 yi® = 2 a?i® — m' ^ = 7w® 

ii=i i=,i 

with n = n' — 1. 

The n variables yi satisfy all the conditions specified in the previous section 
and accordingly we take 

as the test of normality. 

The advantages of regarded as a function of the original variables a?, are 
os follows : like ySa it assumes a characteristic value for infinite normal random 
samples ; the values of its semi-invariants indicate that its distribution is far 
closer to the normal, even for moderate samples, than that of yQj ; and as will 
appear in the following section its frequency distribution can be determined for all 
normal samples. Its principal disadvantage is that it is not symmetrical in the 
original variables, which means that in general it will assume different values for 
the n*\ permutations of the Xi, Consequently there is a risk not present in the use 
of that an abnormal value of Wn may indicate an unrepresentative permutation 
of the sample and not a non -normal parent universe. Accordingly it would seem 
advisable in all cases to randomise the sample a few times and to calculate Wn for 
each permutation. The mean or median value of the different might then be 
taken as the representative value for the purpose of determining the probability of 
normality. 

Another course meriting practical and algebraical investigation is that of taking 

Wn = 2J iCi — ® ly/^ Vw' (n' — 1) « 2 I /Ti — ^ ly V2 («?c — ^)® 




planes :Cj = 0, x^=s- 0 , — in the j/*spaoe for ^4 = constant are the sides of the regular tetrahedron 

with centre of gravity at the origin, one side parallel to the plane ^3=0, one vertex on the 2^3>axis and 
one edge parallel to the pj-axis. This transformation is probably the most symmetrical orthogonal 


transformation of variables, one of which is = 


^/n' 


2 Xj , 


which could be devised. 
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instead of Wn. It may be shown that 

[W] == Lwnl 

and, of course : Lim Wn = 

n-^oo 

and Wn, a symmetrical function of the coi, has a unique value for eflich sample. 
It is evident that for moderately sized samples the value of will not differ 
much from the values of Wn for the different permutations. From work at present 
in progress it appears that in moderately sized samples the distribution of Wn is 
very similar to that of Wn- For n'**51 the values of the first three moments and 
of Bi for these test functions are compared in the following table : 



Mean Xj 

Standard 

Deviation 

Third 
Moment X 3 

•J 


•8019 

•8019 

•0292 

•0295 

- -00000585 
-•00000610 

-0-24 

-0-24 


No significance attaches to the equality of the means : they have been made 
so by choice of constants. To the degree of accuracy indicated, it is very likely 
that for samples of more than 40 the probability points for shown on Table F 
below may be taken as applying to Wn, 

When transformation (8) has been applied to the variables Xi the distribution of 

Un * yjs 

is given by G sin^”® 0 (tt ^ On-z ^ 0) 

with Un = cos 

This is not an efficient test of normality or even for asymmetry in the parent 
universe because the value of y, and hence of will be small for samples drawn 
from any universe. Hn might, however, be used to test the ** representativeness ** 
of the permutation of the n' variables Xi : if the value of is not abnormal, then 
the permutation is probably representative. 

If the parent universe were not normal, the efficiency of the test would be 
leopardised if as a result of transformation the distributions of the transformed 
variables yi were brought appreciably closer to normality. It is fairly evident that 
in general this will not occur. If a be the universal mean of the the value of 
the A;th transformed variable y* will tend towards a — with increasing A?. It may 
be shown that if the pth semi-invariant of the variables is Xp, the pth semi- 
invariant of the transformed variable y* is 

from which it follows that if the ar/s are symmetrically distributed, so are the y^'s. 
The character of the distribution has not been modified fundamentally by the 
transformation. 
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The Frequency Dietribution of Theoretical Aspects. 

The probability of obtaining a given value of 

is equal to 2“ times the probability of obtaining that value with positive values of 
the variables The frequency distribution of can therefore be found by 
confining attention to the region 

(»-« 1, 2, ... n) 

which on transformation into generalised polar coordinates corresponds to the 
region 

(i = 0. 1, 2, ... »-2). 

Vnw„ = <v-i + *n-i(c»_* + «„_s(c»_4+ ••• 

= Cn-% + Vn — 1 

where, as before, c,- and 8i represent cos and sin respectively. 

For a given value of since Wn-i < If 

Wn < + Vn- 1 «n-2, 

or, on dividing across by *Jn and setting 

cos a — , cos /3 =^Wn, 

we find cos (^n-a — «) > cos l3, 

whence the limiting values of arc given by 

a + /3> ^n -2 > a 


It may be shown that a 4- so that the condition a + /3^ may be ignored. 


The condition Wn~i ^ , on a similar analysis, yields the following limiting 

values of <f>n-%, 


with cos 


7--\/ 


>^ + y; <f>n-t < j — 7» 


These conditions only apply when 


or in practice for very small samples. 

Since the polar angles <f>i are all independent for normal samples, the joint 
probability of and 4>n^t is given by 
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where fn^i (Wn-i) represents the frequency distribution of Wn-i and where the 
constant is determined by 

^n— 2 f ^n-2dc^«_2 = 1. 

Ji) 

Now, when <f>n -2 is fixed, 

V n Swn *= Vn -* 1 8n-2 ^Wn-1 > 

and, on substituting for Swn^i, the integral element becomes 

2 ®n-2 l) 

The frequency distribution of Wn is accordingly found from that of Wn-x by the 
integral iteration formula 

/ u feOH(a~-fi) 

f, (w^) == sf — J «;il 2 /n-l (Wn-l) 


.(9), 


with 






\/n—l 8n~2 

When Wn < the value of the function fn-i (^n-i) is zero between the limits 

cos ( J + 7 ) and cos (5 - 7 ) • 


A limiting value of Wn-i 
by setting 


regarded as a function of ^n ~2 (with Wn 


dwn~i _ ^ 
d<l>n-2 ’ 


fixed) is found 


which gives 


_ 1 

^n—2 ~ 

NnWn 


This is found to yield a minimum value of ; whence, on substitution and 
reduction 


(n — 1) — 1- 


If Wr^'^pfriy where p is an integer less than n, this condition gives 

v^n-x > (P — 3 )/(w - !)• 

It will presently be shown that for = 3 the frequency distribution of is repre- 
sented by different functions in the intervals 

1 > > Vi 

and Vi > > Vi 

respectively. From this property and firom the foregoing iteration formula it may 
be deduced that the frequency distribution of Wn has a different functional form in 
each of the intervals 


P 
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Form otfn(wn) near Wn == l/\/n. 

The function Wn assumes its minimum value 1/^n when n — 1 of the variables 
yi are zero, and one is not zero. Suppose that yi is the non -zero variable and that 
the other n—1 variables are so small that their squares are negligible. Restricting 
the variables to their positive values, the joint probability of yi, yn may be 

written 

(\/§) — 

as it may be assumed without loss of generality that the standard deviation of the 
variables is unity. Now 

Wn = 

whence yn = t'»yi — ^ 2 — ••• — yn-i with »„■« V'nWn — 1. 

Change the variable y^ in the integral element so that 


and integrate for positive values of the variables. The element of probability for 
Vn in the neighbourhood of Vn 0 ^i.e. Wn == is found to be 
/ /2\"c. 7 7 -Vn-s , p>nVi-y»---yn^% , 

2«-ian-l)i , 

“tt”/* (n,-2)! 

The last expression represents the probability of Vn near Vn = 0 when a 
particular variable is presumed not zero. Since any of the n variables may be 

presumed not zero to give the value of Wn the probability of Vn is n times the 
foregoing expression. Changing to the original variable Wn the probability of Wn 
in the neighbourhood of Wn = is finally 


w + l 

ttW* (a -2)! 


/ 1 


Form of fn (wn) near Wn « 1. 

Another integral iteration formula for will now be derived which, while 

more complicated in manner of derivation than (9), seems more amenable to alge- 
braical treatment. As before, let y^ (» = 1, 2, ... n) be the normal variables with 
mean zero and standard deviation unity. Our attention may be confined to the 
positive region of the y^ space. In succession transform the variables ortho- 
gonally using transformation (8) and the variables into generalised polar 
coordinates r; ^o, ... so that 

^ cos 
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Then since the yi ^ 0 and rw* =* = 2-8^** ** r*, 

Z1 

Wn * cos ^n- 2 . 

and the integral element of the probability of positive values of the namely, 

«y». 

becomes (\/|)" ® ~ ^ ■ ■ • *»-» *»=» ^^n- 2 . 

The segregation of the factor 

immediately expressible in terms of has thus been effected, and the probability 
of ^n -2 may be formally set down as 

( 10 ) 

(c ^„_2 indicates cot^„_a)» function being given by 

-Fn-a (cfn-2) = f d0o SidOi . . . 8^2^ 

Jd 


iD 

r® 


-3 *^^n-S 


•(11). 


with (\/3”jr ® ** “ 5*'» (i™- 1)! 

and the domain of integration Z), corresponding to 
is given by 


(i = l, 2, ... n\ 


*0 . , Cl "N 

I + -S-U.. 

.'i 


V2 ^6/ \/12/ 

' v2oy 

Vn (n — 1)/ 

«0 Co \ Cl N 

V6/ ^12/1 

1 + M + .. 

. ^ 


' iJiO) 

Vn (n — 1)/ 






«»^-2 d - 


^ii-2 

Vn 


> 0 , 


from (8'). Dividing across by the positive «^^a» it will be observed that all the 
terms except the last of each of the inequalities are free of the angle 6^-% (and 

hence of w^) and the last term becomes This accounts for the form of (10). 

Furthermore it may easily be shown that for all values of the angles ^o, ... ^n-a 

the minimum value of the left-hand terms of the inequalities, for fixed, is 


/ ^ ^ 

V n ^ ^ 



B. O. Gbaby 


828 


so that if satisfies the inequality 

or 

▼ 71 

the domain of integration i> becomes 

277^00^ 0, 

rr^e^^O (t«l, 2, ... (w — 3)), 

and the function (ctn-%^ b ecome s a constant. It follows that the probability of 
s» cos 6^^ when 1 > > vy/ ^ — is givers by 

the constant beiiig given by 




sin»^* Ode « 2^ 


Adverting to the general case, it will be observed that if, after dividing all the 
inequalities except the last by the positive quantity a *iew angle ^'n-s 

given by 

- ( 12 ) 

Vn — 1 v^n(n — 1) 

is introduced, the form of the resulting n — 1 inequalities is identical with that 
of the original n inequalities and hence it follows that the function (cifn~ 2 )» 
given by (11), may be written as 

■^n-2 jcr f -^n-a ^-3 (1 3)* 

c^'n -3 being given by (12). This is the iteration formula required. It will now be 
util ised to determi ne the form of (c^n-a)* hence the frequency of when 

^ prove in fact that the derivative 


fi-4 

2 


(Ctn^,) « ^n-2 (l ~ Z'^l) ^ > 

Ifn^ being a constant. This will be proved by induction. Let it be assumed that 
the result holds for w— 1, i.e. that for 

/n — 2, /n — S 

V V n-l’ 

(l - * . 

Now it may be proved that if c«_. > a/ ” then it follows that 

V n 

^ »i-8 > \/ — 2 — * 


21—2 
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which is equivalent to v/ “ T* Also the function Fn^(ct\^) is equal to a 

/n — * 2 

constant -4n_3 when ^ when ct'n^> Jn — 2, The last inequality 

yields the following limits for c„_ 3 , 

Cn-8 < cos (a + /9), 
c„^ > COB (a — /3), 

with cos a sss — 1 /(n — 1 ) and cos ^ == ?i — 1 . 

Hence, from (13), 

1C t ( tjn'^ rco8(a--/i) 

^ -^n— 2 2 ) ~ -^n— 8 / 4* I w— 3 n— 3 ) — 3 } ^h-3 

J -1 J C08(a+/i) 

Differentiating across with respect to c^n --2 noting that the integral element 
of the second integral vanishes at the limits of integration, we find 


p, y . V __ -dfi-a / 1 X %r f n-3 


On substitution for in the latter integral, upon reduction it is found to be 

of the form (to a constant factor) 


ra:, •tt — 5 

I l(iv — a^o) (o! — a^i)} 2 do!, 

J(Co 


which (to a constant factor) is eqiial to 
In this case 

^1 — ^0 == cos (a — /S) — cos (a-i-/8) — 2 sin a sin /3 




It follows that (to a constant factor) -P^n -2 (c^n- 2 ) equal to 

All the constants have been determined, and the final result is as follows: the 
probability of uf^ = cos 6 when y”T ^>Wn> 

with -^n-i f sin*‘"”®^d^*-2^. 

JO 
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Frequency DlstributionB for n « 2, 3, 4 and 5 . 

The foregoing general propositions completely determine the probabilities of 
for w « 2 and n » 3 as follows : 


w = 3. 


n = 4. 


1 4 

1 > > “To • 


Bwg 


\/2 ’ TT Vi — 
: 4tBw9, 


WjT 


VI > Ws > V4 : 4 (l - ^ cos-*-^— ? - ) Swg. 

\ ir V2 - 2w.*/ 


The probabilities of W 4 when W4 > are as follows : 


1^W4^V| : — sin^3f£;4, 

TT 

Vf ^ ^^4 > V} : ^ cos 0 — sin 3«/4, 


with W4 = sin The form of the probability function has also been determined 
for the third interval, namely VJ Vi. but perhaps the details need not be 
given. The first derivative of J^a(c^ 2 ) was found to be 


IT./ . X 64 /. 3 , / 2c«a® \ 

F, (c<,) = (1 - - cos ^ 3 — . 


(Compare with the frequency distribution of Ws in the second interval.) It was 
found possible to integrate this expression so as to give the following expression 
for the probability of W4 in the interval 


32 / 2 /2c 

Vi > '*^4 ^ Vi ^ ~ cos 0 — sin d — 2 V3 cos 0 cos”-^ g— 


cot*^ 


cot^d 


with cos 0 ^W4, 
n = 5. 


+ 6 cos-» V g 


The probabilities of ws in the first two intervals are as follows : 

: 24(1 — 3^5, 

Vi > ^5 > Vi I ^ ^ 27 r — 10 co8“^ 4. 5 cot 0 ^ 1 — ) sin* 0hw4^ 

with cos ^ a* W5. 

It seems not unlikely that further investigation would have given more 
symmetrical expression to the iteration and other formulae in this section, and 
that exact values for the frequencies for some values of 71 . > 4 and some intervals 
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of might have been obtained. The complicated expressions obtained 

even for n = 4 and n * 6 suggest however that such formulae would be exceedingly 
cumbersome. The present paper has the practical objective of deriving the actual 
numerical probabilities of for samples of different sizes, and it will be shown 
that the results so far obtained are adequate for this purpose 


Computation of Frequency Tables for Wn> 


In Table C will be found the ordinates, and in Table D the cumulative proba- 
bilities of Wn for n =2 to ri =10 inclusive for certain values of These values 
were more or less arbitrarily selected, the pivotal point =* *80 as very close 


wi 


to the mean a / " for infinite samples, and the interval of ’03 as about one-half 

of the standard deviation for w = 10 and therefore less than half the standard 
deviation for n < 10. The values of the ordinates for ?? = 2, 3 and 4 and for 


Wn> 




n 


are the actual values calculated from the formulae given in the last 

section. The remaining ordinates were computed by approximate integration from 
the integral iteration formula (9). For instance, in the calculation of the ordinates 
for the different values of the integral element on the right-hand side was 
regfirded as a function of cos (i.e. C 3 ) and its values found at intervals of *05 
starting with cos ^8 = 0 . The values of ^ 4 (^ 4 ) were found by osculatory inter- 
polation* (used in life table construction) from the previously ascertained values 
yi (’ 86 ), /i (*83), etc.; the values of for were the actual values 

which were easily calculated. 


Finally the values of the ordinate for each selected value of (i.e. *47, *50, etc.) 
were found by approximate integration by dividing the values of the integral 
element into contiguous groups of 3, 4 or 5 and applying the familiar formulae to 
each group. Special methods were used near the upper limits of cos </»3 and near 
the limits of the regions where /i (^^^ 4 ) is zero, as at these limits the values of cos <f>z 
were not multiples of *05. As a check on the work the value of the integral 


was computed by approximate integration and found to equal *0*999, which was 
sufficiently close to unity. 

Without adjusting the ordinates to allow for this slight discrepancy the calcula- 
tion of the ordinates for the frequency of w% was proceeded with in an analogous 
manner, and so on to n = 1 0 . At each stage the proximity of the total probability 
to unity was checked : the actual totals ranged from 0*995 to 1*001 (without any 
adjustment). The work, which involves about 1000 interpolations, was arduous, 
but systematic. 


* See Supplement to the Seventy -fifth Annual Report of the RegUtrar-Qeneral of Births ^ Deaths and 
Marriages in England and Wales ^ Part I — Life Tables, p. 50. 
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TABLE C. 

Ordinates of the Frequency Distribution of w„. 


\n 

3 

2 

4 

a 

6 

4 

6 

6 

7 

6 

8 

7 

9 

8 

10 

9 

11 

10 

1*00 

oo 

4 

0 

0 

0 

0 

0 

0 

0 

•98 

6-3983 

4 

2-0270 

•9604 

•4275 

•1882 

•0814 

•0348 

*0147 

•96 

4-0776 

4 

3-1806 

2*3400 

1*6631 

1-1408 

•7740 

•5190 

•3460 

•92 

3-2487 

4 

3*9920 

3*6864 

3-2698 

2-8246 

2*3794 

1-9780 

1*6315 

•89 

2*7924 

4 

4-6444 

4-9340 

4-7941 

4-6066 

4*3849 

4*1346 

3-88 

•86 

2-4951 

4 

4-9178 

6-1070 

5*4093 

5*6479 

5-79 

5*86 

5*92 

•83 

2-2628 

4 

4 -0809 

4-6789 

6-2092 

6-63 

5-92 

6*29 

6-62 

•80 

2-1221 

2-7019 

3-2978 

3-9074 

4-34 

4*74 

5-13 

6*47 

5*82 

•77 - 

1-9956 

1-9062 

2*6674 

2*91 

3-21 

3*62 

3*75 

3-98 

4-20 

•74 

1 -8930 

1-4049 

1 -8526 

1-99 

2-17 

2*30 

2-40 

2-60 

2*67 

•71 

1*8081 

1-0316 

1-1779 

1-28 

1*32 

1-36 

1*38 

1-37 

1-37 

•68 



•7320 

•7135 

•777 

•767 

•740 

•707 

•678 

•643 

•66 



•4810 

•4362 

•403 

•396 

•370 

•336 

•302 

•270 

•62 



•2647 

•2617 

•224 

•201 

*168 

•144 

•123 

•102 

•59 


•0746 

•1296 

•112 

•0930 

•0728 

•0668 

•0446 

•0349 

•56 



•0532 

•0494 

•0370 

•0294 

•0216 

•0154 

•0104 

•53 





•0124 

•0181 

*0122 

•0091 

•0069 

•0036 

•0024 

•50 





0 

*0040 

*0035 

•0023 

.0016 

•0007 

•0004 

•47 


— 



•0004 

•0007 

•0006 

•0003 

•0002 

•0001 

•44 

( Lower 1 
limit }- 
of i / v ) j 

— 

— 

— 

— 

•0000 

•0001 

1 -0000 

•0000 

•0000 

(- 7071 ) 

(- 5773 ) 

(• 5000 ) 

(- 4472 ) 

(• 4082 ) 

(• 3780 ) 

(- 3536 ) 

(• 3333 ) 

(• 3162 ) 


In Table D the cumulative probabilitiea shown for n = 2 and n — 3 and for 

Wn > -V/ ^ “ O’l’e the actual values calculated from the formulae in the last section. 

The remainder of the table was computed by approximate integ^tion, the formula 
used being based upon that for osculatory interpolation, i.o., for unit interval, 

^ t«o + 13 (ui + 1 ^ 2 ) — Ms), 


P 


a formula of some interest in itself. The slight discrepancies in the total proba> 
bilities were distributed proportionately over the calculated frequencies. 

As a final check, it was calculated from Table D that the mean value of Wxot 
depending for its accuracy on all the interpolations from n = 6 to n =* 10, was 
0*8177 compared with the actual value of 0’8180 given in Table B. ^ 

From the vcdues of and given in the latter table it might have been 
anticipated that the Second Approximation to the Law of Error, namely 


1 j 


fSw 

o’V'Sir 1 

6 ' 

U e^Jf 


to* 

e 
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TABLE D. 

Cumulative Probability of from 


w ,, \ 

3 

2 

4 

3 

5 

4 

6 

5 

7 

6 

8 

7 

9 

8 

10 

9 

11 

10 

1-00 

0 

0 

0 

0 

0 

0 

0 

0 

0 

•98 

•2551 

•0800 

•0271 

•0095 

•0034 

•0013 

•0005 

•0002 

•0001 

•95 

•4043 

•2000 

•1066 

•0690 

•0335 

• 019.3 

•0112 

•0066 

•0039 

•92 

•5128 

•3200 

•2147 

•1495 

•1071 

•0777 

•0578 

•0420 

•0314 

•89 

•6028 

•4400 

•3445 

•2803 

•2295 

•1900 

•1689 

•1335 

•1133 

•86 

•6819 

•5600 

•4915 

• 4 . 3.30 

•3850 

•3460 

•3141 

•2857 

•2623 

• 8 » 

•7534 

•6800 

•6263 

•5810 

•5464 

•5175 

•4928 

•4712 

•4639 

•80 

• 819.3 

•7856 

•7369 

•7106 

•6909 

•6745 

•0608 

•6500 

•6433 

*77 

•8810 

•8535 

•8246 

•8130 

•8043 

•7988 

•7950 

•7926 

•7908 

•74 

• 9 . 39.3 

•9027 

•8907 

•8860 

•8846 

•8868 

•8872 

•8894 

*8920 

•71 

•9948 

•9390 

•9361 

•9346 

•9364 

•9399 

•9431 

•9466 

•9500 

*68 

— 

•9653 

•9639 

•9651 

•9672 

•9707 

•9736 

•9765 

•9791 

•65 

— 

•9834 

•9807 1 

•9824 

•9842 I 

•9868 

•9887 

•9905 

•9921 

•62 


•9945 

•9908 

*9915 

•9929 

•9945 

•9956 

•9966 

•9973 

•59 

— 

•9993 j 

•9964 j 

•9964 

•9971 

•9979 

•9984 

•9989 

•9992 

•56 




•9990 

•9987 

•9990 

•9993 

•9995 

•9997 

•9998 

• 5.3 

— 

— 

•9999 

•9997 

•9997 

•9998 

•9999 

•9999 

1 * 0 ( X )0 

•50 

— 

— 

l-OOOO 

•9999 

•9999 

1-0000 

1-0000 

1 - OOCK ) 

1-0000 

•47 

(Lower 1 

— 

— 

— 

1-0000 

1-0000 

1-0000 

1-0000 

1-0000 

1-0000 

Limit / 

of J 

(• 7071 ) 

(• 577 . 3 ) 

(• 5000 ) 

e 4472 ) 

(• 4082 ) 

(-. 3780 ) 

(• 3636 ) 

(• 333 . 3 ) 

(• 3162 ) 


{a here indicating the standard deviation of w), would furnish a close approxi- 
mation to the actual distribution. It is extremely fortunate and somewhat un- 
expected that, as shown in the diagram, the correspondence is very satisfactory 
for samples of only 10. On this diagram the normal curve, with the actual mean 
and standard deviation of Wm, is shown for the purposes of contrast and com- 
parison. (circles (©) only mark the curve for the Second Approximation, as the 
ordinates of the latter curve are too close to those of the actual distribution to 
permit of accurate drawing. 

The correspondence between the Second Approximation and the actual dis- 
tributioi'. is brought out concisely in Table E, p. 330, which shows the 1 % and 
5 % probability points for the respective distributions. [By definition, the upper 
1 % probability point is the value of Wxo above which lies 1 % of the total frequency.] 

There is scarcely any significant difference between the probability points as 
calculated by these two systems. While Table B shows that the value of does 
not decrease continuously with increasing size of sample, it does so after a certain 
value of n and never differs much from the normal value of 3. There can be no 
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TABLE E. 



1 points 

6 "/o points 


Upper 

Lower 

Upper 

Lower 

Actual distribution 

•941 

•666 

•911 

•710 

Second Approximation to Lfiw of Error 

•942 

•656 

•912 

•708 


reasonable doubt that the Second Approximation will yield sufficiently accurate 
1 % and 5 % probability points for samples of 11 (== 74 ') and over. On this assump- 
tion the following table has been calculated : 

TABLE F. 

The 1 % and 5 % >prohahility points of . 


Size of 
Sample 
n' 

n 

Upper 

limit 

of 

Prot 

Upper 

l"/o 

ability 

Upper 

6°/o 

points 0 

Lower 

5°/o 

f 

liower 

i7o 

Lower 

limit 

ofw,, 

Mean 
of w„ 

Standard 

Deviation 

of 

6 

5 

1 

•980 

•954 

*696 

•626 

•4472 

•8385 

•0786 

11 

10 

1 

•941 

•911 

•710 

•656 

•3162 

•8180 

•0613 

16 

15 

1 

•916 

•891 

•720 

•677 

•2581 

•8113 

•0516 

21 

20 

1 

•902 

•879 

•728 

•691 

•2236 

•8079 

•0464 

26 

25 

1 

•892 

•870 

•734 

•701 

•2000 

•8059 

•0410 

.31 

30 

1 

•884 

•864 

•739 

•709 

•1826 

•8046 

*0376 

.36 

35 

1 

•878 

•859 

•743 

•716 

•1690 

•80.36 

•0350 

41 

40 

1 

•873 

•855 

•746 

•720 

•1581 

•8029 

•0328 

46 

45 

1 

•869 

•851 

•749 

•725 

•1491 

•8023 

•0310 

51 

50 

1 

•865 

•849 

•751 

•728 

•1414 

•8019 

•0295 

76 

75 

1 

•853 

•8.39 

•769 

•741 

•1155 

•8005 

•0242 

101 

100 

1 

•846 

•834 

•764 

•748 

•1000 

•7999 

•0210 

501 

500 

1 

•820 

•814 

•783 

•776 

•0447 

•798.3 

•0096 

1001 

KKK) 

1 

•813 

•809 

•787 

•782 

•0316 

•7981 

•0067 


Conclusion. 

It is a simple matter to devise theoretical universes which random samples 
will identify as probably non-normal by the test and to determine roughly from 
Table F the sizes of the samples required. For example, the value of the ratio of 
mean deviation to standard deviation (in infinite random samples) for the rectangular 
distribution 

iBw (-hl>a?^ — 1) 

/o 

is = *866, so that, with no presumption of knowledge about the nature of the 
distribution, samples of about 75 might be required in order that in the majority 
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of oases the values of found would lie above the upper 1 % probability point. 
The values of the ratios (in infinite samples) for the distributions 

2 

- (1 (+ QO > a > - 00) 

2 1 

which bear superficial resemblances to the normal, are — = *637 and -75 = *707 

TT V " 

respectively, and samples of about 26 and 60 respectively would usually be required 
to indicate their non-normality. It is important to observe, however, that the test 
will often react for far smaller samples. These three universes are symmetrical. 
In cases where asymmetry is suspected a count of + signs and — sigpis of the 
transformed variables yi in moderate sized samples (say of 60 to 100 ) may reveal 
such asymmetry. The number of signs should of course be distributed in the “point 
binomial” ^ 

for normal samples. 

The efficiency of for testing normality with the types of samples actually 
occurring is now a matter for practical investigation. The indications are that 
in practice it will not often react for samples of less than, say, 60. In using it we 
are manifestly liable to what J. Neyman and E. S. Pearson* have called “Errors 
of Type II,” i.e. of accepting an hypothesis Ho (in this case that the universe is 
normal) when some alternative is true. The question arises : if vJn fails to identify 
non-normality for a sample known to be derived from a non-normal universe will 
any other general method prove effective ? In other words, an “existence theorem ” 
for tests of normality seems to be required : a theorem which, while not necessarily 
indicating the form of the test or tests, will at least establish the “maximum 
sensitivity” (with a suitable definition of “sensitivity”) however approximately 
and with whatever simplification in the conditions of the problem. This may be 
quite small for small^samples ; it is manifestly nil for samples of 2. As shown by 
R. A. Fisherf for in normal samples of 3 and in the present paper for W 2 . the 
distributions are wholly or partially U-shaped, and in consequence they reveal no 
region of improbability. On the other hand Wn must identify non-normality in 

indefinitely large samples, unless the universe has the normal value of ^or the 

ratio of mean deviation to standard deviation. The only important non-normal 
distribution which assumes the normal value for the ratio appears to be the Second 
Approximation to the Law of Error which also gives the normal value of 3 for 
in infinite samples. 

Smumeury. 

The values of the moments and of VRi and B% of the distribution of for 
normal samples indicate that impracticably large samples would be required in 
order that the distribution would be presumed normal. As the frequency distribu- 
tion of ySa is unknown and as V/Si is to be regarded as a test of asymmetry rather 


* *<The Testing of Statistioal Hypotheses in Belation to Probabilities a priori.** Proceeding$ of the 
Cambridge Philoeophieal Society^ Vol. xxix. Part 4, p. 498. 
t Op. cit.f p. 20. 
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than of normality, we are constrained to seek new measures of normality. The 
measure considered in this paper is the ratio of the mean deviation to the standard 
deviation calculated from the sample, which ratio in infinite random samples 

equals 


Two cases are considered : (i) that in which the experimental technique is such 
that the universal mean may be presumed known and hence, by a change of origin, 
presumed zero ; (ii) the general case of both universal mean and standard deviation 
unknown. In the first case, if the sample measures are yi, y2> yn> we take 




i -JM. 

»Jn VSyt* 


In the second case, if the sample measures are ... arranged in random 

order, we take 


Wn 




Xx 4- . . . 4- a?,: ~ %Xi^ 


Vi (t -I- 1 j 


^ j \/ 


n = n' ~ 1. 


In this case has not an unique value for each sample and it is suggested 
tentatively (pending further work) that for moderately sized samples the function 

Wn — 2 I ® I / v/ 2 («{ — 2)* 

be used instead of Wn- The frequency distribution of tVn may be regarded as 
completely determined for all normal samples. From Table F may be determined 
the approximate improbability of the value of Wn found from a given sample 
occurring if the sample were drawn at random from a normal universe. 
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A COMPARISON OF p, AND MR GEARY’S i/;, CRITERIA. 

By E. S. PEARSON, D.Sc. 

1. The Choice of alternative Testa, 

In the development of statistical sampling theory it has often happened that 
more than one test of a given hypothesis is available. Generally on theoretical 
grounds it is possible to specify which of these tests is the most efficient; but it 
may happen that owing to mathematical difficulties in putting the ideal test into, 
working form or to practical difficulties arising from the extent of computation 
involved, the statistician will choose to employ a second best but simpler test. In 
the case of testing the hypothesis that a sample has been drawn from a normally 
distributed population, it seems likely that for large samples and when only small 
departures from normality are in question, the most efficient criteria will be based 
on the moment coefficients of the sample, e.g. on the values of and While, 
however, the sampling moments of these quantities are known*, it has not so far 
been possible to determine the actual sampling distributions from which the 
probability integral could be obtained exactly. 

In earlier papers in this Journals I suggested that empirical curves having 
the correct first four moment coefficients might be used as approximations to 
the unknown true distributions, and gave tables of 5 and 1 ®/o levels of 
probability for and /32, which have since been republished in Tables for 

Statisticians and Biometricians, Part II (Table XXXVII bis). The sampling distri- 
bution of being symmetrical and not very leptokurtic presents relatively little 
difficulty, and a recent investigation by K. Williams t suggests that the tables 
previously given cannot be far in error. A test on ^ P\ is however only sensitive to 
lack of symmetry, and the adequacy of the tables for the 5 and 1 limits for 
P 2 is at present more doubtful, since the moment coefficients indicate a very skew 
and leptokurtic distribution which it is difficult to approximate to with confidence 
from a knowledge of four moment coefficients only. 

R. C. Geary in his interesting paper above has therefore suggested an alterna- 
tive criterion, which is based on the ratio of the mean deviation to the standard 

* B. A. Fisher, Proe, Boy. Soc. Series A, Vol. 130 (1930), pp. 16 — 28. 

t Bionutrika, Vol. xxii. (1930), pp. 239 — 249, and pp. 423 — 424. 

X Biometrika, Vol. xxyn. (1933), pp. 269 — 271. Mr Williams has extended my tables for n/^ down to 
n' = 20. 
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deviation. This criterion which, as shown below, is sensitive to changes in the 
population has the advantage that 

( 1 ) it yields a more manageable sampling distribution, for which he has been 
able to provide tables of percentage limits; 

( 2 ) it involves less trouble in calculation, since the mean deviation is more 
readily obtained than the fourth moment. 

In assessing the value of these alternative tests it is necessary to bear in mind 
two risks of error in judgment: 

Type I, that of rejecting the hypothesis tested when it is true, i.e. of deciding 
the population is not normal when it is; 

Type II, that of failing to detect departure from normality when it really exists 
in the population. 

In the case of ^ 02 , no attempt was made in my previous paper to calculate 
probability levels for rK 100 because of the very high values of ^ 1 (^ 82 ) and 
find before further investigation is completed, the limits given for samples 
of less than 200 or 300 must be regarded as only approximate. Mr Geary has how- 
ever been able to obtain precise levels for his ratio and reports that investiga- 
tions at present in progress give reasons for believing that for n* > 40 the more 
generally useful criterion will have the same sampling distribution as His 
test therefore as far as it concerns kurtosis rather than asymmetry does provide a 
reliable control of the error of Type I. Its efficiency in dealing with the other type 
of error can be most readily judged on the basis of some numerical investigations. 
Such investigations have been carried out after consultation with Mr Geary*; the 
work has been rather lengthy as it seemed desirable to compute values for and 
/92 as well as for Wn ^ and in some cases 

2 . The Experimental Sampling, 

The problem to be dealt with is that of comparing the relative efficiency 
of Wn, and >^2 in controlling errors of Type II, i.e. in detecting departure 

from normality in the sampled population. Samples were drawn with the help of 
Tippett’s Random Sampling Numbers f from the seven different populations, which 
are shown in Table I. Further details of the experiment are given in an Appendix 
to this paper. 

The populations II, III, IV, VI and VII have been used in a number of previous 
sampling experiments J ; I and V, the rectangular and double exponential popula- 
tions are referred to by Geary on p. 330 of his paper. Population IV was derived 
from the equation 

y = yo(i (!)• 

* My apologies are due to Mr Qeary for the delay in the publication of his paper which has resulted, 
but I hope that he will regard the work as worth while. The heavy computation has been undertaken 
by students in the Department of Applied Statistics, University College, London, and in particular by 
Miss C. M. Thompson. 

t TtacU for Computer$y No. xv. 

t For an account of these distributions see also J. M. le Boux, Biometrika, Vol. xxm. (1931), 
pp. 158—161. 
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When the distribution law is very leptokurtic there is inevitably a certain 
difference between the moments of the probability distribution represented by the 
Random Number Scale (adjusted to a total of 10,000) used in sampling and the 
moments of the theoretical law. It is the former that have been entered in Table I. 
For exam pie, for the double exponential curve 6*0 and (Mean deviation)/(Standard 

deviation) “Wn** '707; .but for the sampling scale used and Wn=«'700. 

TABLE I. 

Populations sampled. 


No. 

Type 

■jPi 

Pt 

Mean deviation 

Standard deviation 

I 

Roctangiilar 

0-0 

1-8 

866 

II 

Pearson Typo 11 

0*0 

2*6 

•818 

III 

„ VII 

0*0 

4*1 

•768 

IV 

„ « VII 

00 1 

7’1 

•734 

V 

Double Exponential 

0*0 

5-9 

•700 

VI 

Pearson Type III 

0-7 

3-75 

•789 

VIJ 

I 

»» >» * 

1-0 

3*8 

•797 


Again the law of equation (1), p. 334, above has /32 =* 9*0, while for the sampling 
scale /Sa* 7*05; the difference arises owing to the practical difficulty of representing 
adequately for sampling purposes a curve of this form. These points arc refen*ed 
to again in the Appendix. 

The sampling may be divided into three groups. 


Series A. 

Six samples of 60^ were drawn from each of the populations II, III, VI 
and VII. If the 50 values of the variable sc are denoted by a?a» then 

Geary*s criterion w^, may be calculated from the n =* n'—l* 49 values of y obtained 
by the transformation of his equation (8). By random re-arrangements of the 60 
icSt four series of y's were obtained, and hence four different values of Wn* for each 
of the six samples from the four populations. Samples of 60 were also obtained from 
a Normal population and treated in the same manner. Ektch series of 60 values of 
X gave a single value of 




n' 

2 \xi^x\ 

t «»1 

\/ n' 2 (xi — wy 


Mean deviation 
Standard deviation 


( 2 ) 


where x is the mean value of x in the sample. Table II contains all these values 
of Wn and Wn, which have also been plotted in Fig. 1. On this diagram the lower 


n' will be used for the sample size to conform with Geary’s notation. 
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TABLE II. 

Samples q/* 50 ; Values of Wn o/nd 



Normal 

Population 

Population II 

Population III 

Population VI 

Population VII 

Sample 













%0n 

Wn 

w„ 

Wn 

Wn 

Wn 

Wn 

Wn 


(a 

•791 

•802 

•866 

•864 

•760 

•769 

•822 

•822 

•764 

•778 

1 J* 

•820 


•831 


•766 


•813 


•791 



•813 


•860 


•780 


•833 


•782 


U 

•800 


•842 


•761 


•809 


•776 


fa 

•799 

•822 

•859 

•864 

•791 

•774 

•792 

•792 

•832 

•805 

O A 

•780 


•859 


•797 


•815 


•817 


"V' 

•838 


•864 


•762 


•800 


•737 


u 

•834 


•861 


•766 


•817 


•830 


fa 

•774 

•749 

•810 

•814 

•827 

•811 

•830 

•796 

•8.30 

•817 

sJ* 

•764 


•820 


•799 


•798 


•826 



•759 


•788 


•788 


•783 


•807 


[d 

•758 


•816 


•818 


•794 


•833 


fa 

•817 

•826 

•808 

•790 

•786 

•789 

•821 

•814 

•800 

•788 

4 4^ 

•827 


•809 


•809 


•818 


•791 



•833 


•807 


•778 


•784 


•817 


u 

•830 


•830 


•785 


•795 

'* 


•774 


fa 

•824 

•826 

•782 

•816 

•786 

•782 

•789 

•787 

•833 

•812 

4'* 

•812 


•788 


•767 


•808 


•800 


■796 


•824 


•783 


•792 


•812 



•817 


•796 


•787 


•808 


•767 


(a 

•801 

•807 

•835 

•841 

•758 

•726 

•811 

•847 

•828 

•806 


•8(X) 


•836 


•748 


•847 


•823 


•804 


•834 


•720 


•859 


•776 


u 

•827 


•848 


•764 


•868 


•800 


Mean 

•8046 

•806 

•8272 

•830 

•7773 

•776 

•8127 

1 ! 

•810 

•8019 

1 

•801 


N.B. a, b, c and d are four random arrangenionts of the same 60 observations. 

and upper 5 °/^ and 1 probability limits and the mean level have been drawn as 
follows : 

Lower limits: 5 7, '751, 1 7, -728. 

Upper limits: 6 7o 1 7o 
Mean level *802. 
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FiG.l . '\5 ^ubs of IN Samples of 50. 



SCALE OF Ur^ 

C • REPRESENTS UT^ (-4 VALUES FOR EACH SAMPLE). 

C ^ REPRESENTS ( I VALUE FOR EACH SAMPLE). 

The figures are taken from Geary's Table F. The diagram shows clearly that: 
(a) The four values of Wn vary very considerably about Wn» 

(h) Wn cannot be taken as representative of Wn‘, it sometimes lies outside the 
range of the four values. 

(c) When the population is not Normal very few of the sample points fall 
beyond even the 5 limits, so that departures from normality of this order could 
rarely be detected with samples as small as 50. As far as can be judged from this 
small experience the chance of detection appears on the whole to be no greater 
using Wn than Wn- 

The labour involved in the calculation of even a single value of Wn is very 
considerable, and since for n' as large as 60 there is every reason to believe that 
Wn follows closely the same sampling distribution as Wn, it appears that this 
simpler criterion can be used; in the later experiments therefore Wn alone has 
been calculated. 


Biometrika xxtxx 


22 
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In the case of populations VI and VII the ratio of mean deviation to standard 
deviation was ‘789 and *797 respectively, differing only very slightly from the 
Normal value of V 2 / 7 r * *798. It was therefore not to be expected that the test 
would be efficient in these cases. In the case of populations II and III the ratios 
are 818 and *768 respectively, but even here owing to the small size of sample, 
only 2 out of 6 and 1 out of 6 , respectively, of the sample values of are seen to 
fall beyond the 5 limits. 

The values of V/Si and Qz were also calculated for each of the six samples; for 
these criteria of course no change occurs on rearranging the x'%. The results are 
shown in Table III and plotted in Fig. 2 , where 5®/^^ and 1 ®/^ limits for appro- 
priate for samples from a Normal population have been inserted*. The V test 

TABLE III. 


Values of ^ anid 0% in Samples of 50. 


Hample 

Population 11 

Population III 

Population VI 

Population VII 

No. 



-Jpi 

^2 


P 2 

Jp, 

^2 

1 

-•229 

1-962 

•484 

3-753 

•124 

2-513 

1-086 

3-666 

‘2 

•201 

2-122 

•718 

4-181 

•903 

3-289 

1-131 

3-627 

3 

•215 

2-309 

- -511 

2-517 

•856 

2-984 

•674 

2-915 

4 

•263 

2-282 

- -294 

2*931 

•175 

2 ' 5 vS 0 

•906 

2-934 

5 

-282 

2-332 

• 4.34 

3*054 

•743 

3-203 

1-022 

3-601 

6 

•161 

2-308 

•900 

5-645 

•409 

2-301 

•994 

3-427 

Mean 

+ •149 

2-219 

+ -288 

3*680 

+ •535 

2-812 

+ -969 

3-362 


is reasonably efficient in detecting the skewness of populations VI and VII; for the 
other two cases (II and III) where the / 8 a test would be appropriate, no limits are 
available. Since however for /i'=» 50, = *60 approximately, it is doubtful whether 

significant departures from normality could be established in the case of any single 
sample even if the true sampling distribution were known. It is of interest to note 
that in all but 2 of the 24 samples, ^ 2 . is less than the corresponding population 
value. This bias which appears also in the other experiments will be referred to 
again below. 

Series B. 

A second experiment consisted in drawing six samples of 300 from each of the 
two symmetrical Non-normal populations II and III. In this case the 5®/^ and 1 ®/^ 
limits for >82 previously calculated are probably fairly accurate, so that a satisfactory 
comparison of the >82 and Wn tests is possible. The results are shown in Table IV, 
p. 340, and Fig. 3. 

* These are obtained from my table already referred to, Biometrikat Yol. xxzz. (1980 — 1931), p. 248. 
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The correlation between the two criteria is extremely high*» so that a decision 
based on a knowledge of Wyl is not likely to differ much from that based on and 
vice versa. There is certainly a suggestion as indicated in the following table that 
is somewhat more efficient in detecting lack of normality than but with only 
six samples available the evidence is far from conclusive. The mean values of 
are slightly less and the mean values of slightly more than the population 
values given in Table I, but the bias is far less than in the samples of 50 (Series A) 
and 76 (Series C). 

TABLE V. 


Samples o/300; Comparison of Criteria, 



Population 11 

Population III 


Using jSj 

Using 

Using /3a 

Using 

Number of sample points : 

Within 5 limits ... 





0 

2 

0 

2 

Between 5 and 1 limits 

2 

0 

3 

2 

Beyond 1 limits 

4 

4 

3 

2 


varies of course about zero, but the standard error increases as the population 
becomes more leptokurticf and the values for the Ist and 6th samples for population 
III (+’490 and — •4o6) fall far outside the l7o limits (±*829) for a Normal 
population. This does not however mean that the test should be used in this 
case, since the modal value of Si is zero. 

Series C, 

In this case ten samples of 76 were drawn from each of the populations I, IV 
and V; the populations differ more from the Normal than in the preceding case 
but the samples are considerably smaller. Values of V^, St ^'^d are shown in 
Table VI, p. 343, and the hxst two are plotted in the form of a correlation diagram 
in Fig. 4. In this case no reliable 5°/^ and limits for St are available. As a 
rough indication of their position, however, the procedure adopted in my earlier paper 
has been followed, i.e. the first four moment coefficients appropriate for n'«76 
have been calculated J and the probability levels obtained by quadrature of the 
Pearson Type IV curve having these moments. The results were as follows: 

Lower limits: 1°/^ at 2'14, 6°/^, at 2*27. 

Upper limits: 5% 3*86, 1% 4*67. 

* See Mr Geary’s Section III below. 

t First approximation values of the standard errors of and /Sj, for samples from different Pearson 
type curves may be obtained from Tahle$ for Statisticians and BiometrieianSf Part i. Tables XXXVII and 

xxxvin. 

t The formulae used are (5) to (8) of my paper in BUmeirika, Vol. xxiii. (1981), p. 428, which give 
mean /33=2-922, <r^^0-510, = 1*971, Pj (/53)=7*4e0. The calculations were kindly undertaken by 

Mr K. Williams. 
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Fig ^ Comparison of /S* « < in Samples or 76. 


•88 


•84 


•80 


*76 


•72 


•68 


•64 

1-2 20 28 3-6 4-4 5*2 60 6*8 76 

SCALE OP 



Turning to the diagram it is seen that there is again a high correlation between 
the two criteria and Wn\ which is less intense for the leptokurtic populations 
than for the rectangular. Counts of the points falling beyond dififerent limits leads 
to Table VII on the opposite page, analogous to Table V. In view of the 
uncertainty of the position of the limits, it would not be justifiable to make 
any critical comparison of relative efficiency of the two tests in this case, but there 
is certainly no evidence that the test is less efficient than the ^2 test. 


A noticeable feature of Table VI is the bias both in A and for samples 
from leptokurtic populations. For the case of normal populations this bias can be 

^ 

corrected by the use of multiplying factors (exactly for /9i) and e 


(approximately for Wn\ but these factors will clearly differ for other populations. 
It will be seen that for Population IV the mean of the 10 sample values of ^2 
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TABLE VI. 

Samples q/*76 ; Values of Criteria. 


Sample 

Population I (0*0, 1*8) 

Population IV (0*0, 7-1) 

Population V (0-0, 6*9) 

No. 

s/ft 

ft 


s//*l 

ft 


s/ft 

^2 


1 

+ •090 

1-821 

•865 

+ •190 

4-038 

•757 

-•462 

6*250 

•710 

2 

+ •316 

1-839 

•861 

- 1-426 

6-200 

•736 

+ *443 

6*189 

•675 

3 

+ •282 

1-709 

•876 

-•797 

5-957 

•722 

+ •822 

3-609 

•767 

4 

+ •039 

1-582 

•807 

-•766 

3-937 

•763 

+ •464 

4*441 

•707 

6 

- *232 

1-946 

•844 

+ •576 

6-874 

•734 

+ •292 

5*240 

•686 

6 

+ •198 

1-709 

•882 

+ •663 

3-143 

•781 

-•169 

4-285 

•705 

7 

-•232 

1-933 

•863 

+ •670 

3-334 

•774 

- -680 

4-093 

•721 

8 

+ •315 

1-736 

•881 

+ •033 

6-323 

•697 

-•219 

4-336 

•709 

9 

+ •226 

2-044 

•847 

- -688 

4-350 

•724 

+ •106 

4-148 

} -754 

10 

- -032 

1-772 

•871 

-•442 

3-980 

•720 

- -390 

7-149 

•648 

Mean 

+ •097 

1-809 

•868 

- -208 

4-614 

•741 

+ •021 

4-864 

•708 


TABLE VII. 

Samples o/76; Comparison of Criter^ia. 



I'opnlation I 



Population IV 

Population V 


Using 

Using 

< 

Using /Sjj 

Using 

< 

Using /Sj 

Using 

«>»' 

Number of sample |K)ints : 







Within 5 7^ limits 

0 

0 

2 

3 

1 

1 

Between 5 and 1 limits 

0 

2 

4 

1 

6 

1 

Beyond 1 7o limits ... 

10 

8 

4 

6 

4 

8 


TABLE VIII. 

Samples of 76; Comparison of Criteria, 


Estimate of 
standard error of : 

Population I 

Population IV 

Population V 

Exact values for 
Normal population 

Wn 

•017 

•027 

•036 

•024 


•137 

1-129 

1-106 

•510 

^/A 

•221 

•710 

•456 

•270 
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(4*61) falls short of the population value (7*1) by over 33 The bias in Wn^ 
here in the opposite direction, is relatively of much less importance. In the same 
way, while it will be seen that the standard error of increases as the population 
becomes more leptokurtic, this increase is very definitely less than that in the 
standard error of I^t the reader examine, for example, the sets of 10 samples 
in the Tables on p. 343. The results given there suggest that in samples of 
moderate size w,/ provides a better estimate than of its corresponding 
population parameter. The problems of estimation and of testing the hypothesis 
of normality are not however identical, and much more work is necessary before 
a final comparison of the merits of these two criteria can be made. 

3. Conclusions, 

(1) In testing for departure from normality it would no doubt be desirable to 
have a single test sensitive both as regards skewness and kurtosis. As this is not 
available it is necessary to apply two separate tests. 

(2) To detect skewness the best criterion would appear to be for which 

tables of 5 and 1 probability levels, believed to be sufficiently accurate for all 
ordinary purposes are available f. 

(3) As a test of whether the population sampled is platykurtic or leptokurtic, 
either ^ 2 * or may be used. Out of these there are strong practical grounds 
for choosing Wn ^ the ratio of mean deviation to standard deviation, unless the 
sample is very large when ^2 may be used. The reasons for this are as follows: 

(a) Until further investigation has been carried out, the accuracy of the tables 
of 5 and 1 probability levels for ^2 cannot be established. 

(h) While the sampling distribution of is known and accurate tables have 
been given by Mr Geary, the calculation of this criterion is very troublesome except 
in the case where the population mean is given. 

(c) There are strong grounds for believing that if n' > 40, Wn may be referred 
to the Wn tables of 6°/^ and 1®/^ probability levels. 

(d) The calculation of Wn is straightforward and shorter than that of /Sz; if the 
data are grouped an approximate adjustment to obtain the mean deviation may be 
used J. 

* This bias may be explained as follows. The high value of /Sg for the distribution law of equation (1), 
is due to the very long drawn-out tails. The chance however of obtaining an observation falling in these 
tails in samples even as large as 76 is small, so that the sample /S^’s have a very skew sampling distribu- 
tion with mode well below the population value. The standard error is however infinite. 

t Tables for Statisticians and Biometricians, Part n. Table XXXVII W« for n" = 60, 76, 100... etc. and 
Biometrika, Vol. xxvn. (1934), pp. 269 — 271, for n'=20, 80,... 90, 100. 

{ If the centre of the group containing the mean, x, of the sample be taken as the origin of x, if 
observations fall into the tth group having central value Xf, if tio observations fall into the group con- 
taining the mean and if the mean is at a distance $ from the origin ( - ^ where the unit is the 

group breadth, then we may take as an approximation 

2 / 1 J X number of observations in groups below the origin group} 

- I d X number of observations in groups above the origin group} + wo (i + ^). 
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{e) There is clear experimental evidence that in the case of leptokartic popula- 
tions the sample provides a biased estimate of the population value. This bias 
is far smaller in the case of 

(4) A high negative correlation exists between the sample values of ^ and 
Wn *. This means that in testing the hypothesis of normality, in any given case 
a conclusion drawn from the value of one criterion is unlikely to differ much from 
that which would be drawn from a knowledge of the other. If the uncertainty 
regarding the probability levels of /8a could be removed, it is likely that for large 
samples this criterion would be found somewhat more efficient than in detecting 
departure from normality. This point cannot however be regarded at present as 
established. 

* See Section III added by Mr Geary giving the value for this correlation when the population sampled 
is Normal. 


[Note. The object of publishing the following Appendix containing 72 random 
samples of various sizes from diverse forms of curves is to put into the hands of 
those desiring to test other, old or new, criteria ready-made samples and save them 
from the heavy labour of obtaining afresh their own samples. Ed.] 



APPENDIX. 


FULL DATA CONCERNING POPULATIONS AND SAMPLES USED IN 
A PAPER BY E. S. PEARSON ENTITLED: “A COMPARISON OF ^ 

AND MR GEARY’S w,, CRITERIA.” 

For the purpose of future reference and comparison, it has appeared useful to 
publish in this appendix full details of the populations sampled and the method of 
sampling, and also to give the complete data of the samples themselves. 

In sampling with Tippett s Random Numbers {Tracts for Computers, No. XV) 
the method of procedure described on pp. iv — v of the introduction to the Tract 
has been employed. The frequency distributions of the 10,000 numbers repre- 
senting each population are shown in Table X, except in the case of Population I 
(Rectangular); here the sampling was arranged so that there was an equal chance 
(i.e. of the variable, x, assuming values — 16, ... — 1, 0, + 1, ... + 16. This was 
effected by assigning the numbers 01 — 03 to = — 16, 04 — 06 to 16, 

... 97 — 99 to = -f- 16, omitting the number 00 from consideration. 

In apportioning the 10;000 numbers it is inevitable that difficulty should arise 
in the case of very leptokurtic distributions. This difficulty was discussed in a 
note contributed to Dr le Roux’s paper, to which the reader is referred *. It means 
that the actual populations sampled cannot correspond exactly to those defined by 
the theoretical curves, but from the point of view of the present investigation this 
is of no consequence, since the objective has been to investigate the efficiency of 
various criteria in detecting certain wide departures from normality. 

The frequency constants shown in Table IX have been calculated from the 
actual distributions sampled, i.e. from those given in Table X, without applying 


TABLE IX. 

Frequency Constants of Sampled Populations, 



I 

II 

Normal 

III 

IV 

V 

VI 

VII 

Mean 

0*000 

0*000 

0*000 

0-000 

0*000 

0*000 

0*142 

0-147 

Standard Deviation 

9-62*2 

10*644 

10*004 

5*67.5 

6*455 

3*546 

5*008 

12*370 

Mean Deviation 

8*2.60 

8*629 

7*986 

4*378 

4*757 

2*483 

3*961 

9*860 


o-ooo 

0000 

0*000 

0*000 

0*000 

0*000 

0*701 

0*995 

^2 

1*798 

2*600 

3*000 

i 

4*116 

7*052 

6*884 

3*725 

3*834 


hunnetrika, Vol, xxni pp 168 — 169. 
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any corrections for grouping or abruptness except in the case of the mean deviation 
where the expression given in the footnote i on p. 844 above was used. The unit 
employed in the case of the mean, standard deviation and mean deviation is the 
unit of w shown on the A;>scales of Table X. It will be seen that this varies from 
about f to of the standard deviation; it would, no doubt, have been desirable to 
arrange a finer grouping unit in the case of Populations III — VI, but the data 
were collected from sampling experiments initiated at different times and with 
different objectives. 

The sampling results are shown in Tables XI — XIII; for Experiment A, the 
50 individual values of ^ are recorded for each sample. These values are tabled in 
a completely random order; in fact the order is that used in calculating the first 
of the 4 sets of values of Wn given in Table II. The rearrangements needed to 
obtain the other 3 values of Wn were carried out by writing the 60 a?-values on 
small counters, thoroughly mixing and drawing out randomly in order. 

For Experiments B and C the tables show the frequency of occurrence in each 
sample of the different values of a. Thus in Table XII the columns each total 
300 and in Table XIII they total 76. 

The sample frequency constants tabled in the preceding paper have been 
calculated directly from these figures without further grouping. No Sheppard’s 
Corrections have been applied, either to the second or fourth moments ; the mean 
deviation was calculated from the expression given on p. 344 above 

* Further reflection suggests that if Sheppard’s Corrections are not used, the correction to the mean 
deviation to adjust the contribution from the central group is not really justifiable. The consistent 
course would, no doubt, have been to suppose that x could assume certain discrete values only; the 
resulting alterations to the values of the mean deviation would, however, be too small to be of practical 
signiflcanoe. 
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Appendix to Section II 
TABLE X. 

Grouped Frequencies in Populations Sampled. 



Ponulation freauenoiea for svmmetrioal 

Population frequencies for skew distributions 

II 

/J2 = 2-6 

d 

Normal 

istributio 

lU 

/Js=4-1 

ns 

IV 

ft = 7-1 

V 

VI, v'ft=0-7. 
ft=8-75 

VII, ^191 = 1-0, ft = 

= 3*8 

X 

Frequency 

X 

Frequency 

X 

Frequency 

0 

348 

398 

774 

758 

1812 

-12 

1 

-17 

141 

+ 23 

62 

± 1 

348 

397 

758 

740 

1350 

-11 

9 

-16 

245 

+ 24 

67 

+ 2 

345 

391 

710 

690 

905 

- 10 

37 

-16 

298 

+ 25 

52 

+ 3 

340 

381 

637 

615 

606 

- 9 

98 

-14 

332 

+ 26 

47 

+ 4 

332 

368 

550 

529 

406 

- 8 

197 

-13 

353 

+ 27 

43 

+ 5 

322 

352 

459 

440 

273 

— 7 

327 

-12 

367 

+ 28 

39 

4* G 

311 

334 

369 

355 

183 

- 6 

472 

-11 

375 

+ 29 

35 

± 7 

298 

312 

291 

282 

122 

- 5 

611 

-10 

377 

+ 30 

32 

+ 8 

284 

289 

222 

221 

82 

- 4 

726 

- 9 

376 

+ 31 

28 

± 9 

268 

266 

168 

170 

65 

- 3 

802 

- 8 

373 

+ 32 

26 

+ 10 

251 

242 

123 

130 

37 

- 2 

839 

- 7 

366 

+ 33 

23 

+ 11 

234 

218 

91 

100 

25 

- 1 

834 

- 6 

358 

'+34 

21 

+ 12 

214 

195 

66 

77 

16 

0 

796 

- 5 

349 

+ 35 

18 

+ 13 

196 

171 

47 

58 

11 

+ 1 

734 

- 4 

339 

+ 36 

16 

+ 14 

177 

150 

34 

45 

8 

+ 2 

654 

- 3 

327 

+37 

15 

+ 15 

159 

129 

24 

35 

5 

+ 3 

569 

- 2 

315 

+ 38 

13 

+ 16 

139 

111 

18 

27 

3 

+ 4 

482 

- 1 

303 

+ 39 

11 

±17 

122 

94 

12 

21 

2 

+ 5 

400 

0 

290 

+ 40 

9 

+ 18 

104 

79 

9 

16 

2 

+ 6 

325 

+ 1 

277 

+ 41 

9 

+ 19 

88 

66 

7 

13 

1 

+ 7 

260 

+ 2 

264 

+ 42 

8 

+ 20 

74 

54 

5 

10 

1 

+ 8 

205 

+ 3 

252 

+43 

6 

+ 21 

60 

44 

3 

8 


+ 9 

159 

+ 4 

239 

+ 44 

6 

+ 22 

46 

36 

3 

7 

— 

+ 10 

121 

+ 5 

226 

+ 45 

5 

±23 

37 

28 

1 

5 

1 

+ 11 

92 

+ 6 

213 

+ 46 

4 

±24 

27 

23 

2 

5 


+ 12 

69 

+ 7 

202 

+ 47 

3 

+ 25 

19 

17 

1 

3 

— 

+ 13 

51 

+ 8 

190 

+ 48 

3 

±26 

14 

14 

1 

3 

— 

+ 14 

37 

+ 9 

178 

+ 49 

3 

± 27 

8 

10 

— 

2 


+ 15 

27 

+ 10 

168 

+ 60 

2 

+ 28 

5 

8 

1 

2 

— 

+ 16 

20 

+ 11 

157 

+ 61 

2 

±29 

3 

6 

— 

2 

— . 

+ 17 

14 

+ 12 

146 

+ 52 

1 

+ :io 

1 

5 

— 

2 

— 

+ 18 

9 

+ 13 

137 

+ 63 

1 

±31 



3 



1 

— 

+ 19 

7 

+ 14 

128 

+ 54 

1 

+ 32 


2 

— 

1 

— 

+ 20 

5 

+ 15 

119 

+ 65 

1 

±33 



2 

— 

— 

— 

+ 21 

4 

+ 16 

110 

+ 56 

1 

±34 

— 

1 

1 

1 

— 

+ 22 

2 

+ 17 

102 

+ 57 

— 

±36 



1 


1 

— 

+ 23 

2 

+ 18 

94 

+ 58 

1 

±36 


1 

.... 

— 

— 

+ 24 

1 

+ 19 

87 

+ 59 

— 

±37 


... 

— 

1 


+ 25 


+ 20 

80 

+ 60 

— 

+ 38 





— 

lat*±40 

— 

+ 26 

1 

+ 21 

74 

+61 

— 

+ 39 

— 

1 

— 

lat +47 

— 

+ 27 

— 

+ 22 

68 

+ 62 

1 

±40 

— 

— 


1 at ±58 

— 

+ 28 

1 

continued al>ove 



Totals 

10,000 

10,000 

10,000 

10,000 

10,000 

Total 

10 poo 

Total 

10,000 


N.B. For Population I (Rectangular) see Bi)ecial reference in text. 
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TABLE XL 

Eeeperiment A. Individual Values of x in Six Samples of 50 from Five Populations. 


Population 11 

Normal Population 

Population III 

+ 9 

+ 5 

+ 1 

0 

+ 12 

+ 6 

+ 5 

0 

- 3 

+ 10 

+ 15 

- 6 

+ 

2 

- 6 

+ 

1 

+ 1 

+ 4 

0 

- 9 

+ 11 

-19 

-10 

_ 4 

+ 16 

- 3 

+ 12 

+ 3 

+ 

4 

- 1 

- 

6 

+ 5 

- 7 

+ 8 

- 2 

-12 

+ 16 

- 8 

+ 9 

- 3 

+ 16 

+ 6 

- 4 

+ 

4 

+ 4 


0 

- 6 

+ 12 

-21 

+ 2 } 

+ 7 

+ 10 

- 8 

- 4 

-16 

+ 11 

+ 17 

+ 10 

- 

8 

- 2 

— 

1 

-f 5 

+ 22 

+ 17 

-15 

+ 11 

-13 

- 8 

+ 15 

-28 

+ 4 

0 

+ 4 

+ 

3 

+ 1 

— 

2 

4-16 

+ 18 

-12 

-16 

- 8 

- 7 

+ 9 

+ 6 

+ 16 

+ 16 

+ 3 

+ 3 

+11 

- 3 

— 

7 

-16 

+ 10 

+ 26 

+ 15 

- 14 

-19 

+ 22 

0 

- 7 

+ 13 

+ 9 

- 6 

+ 

4 

- 5 

- 

5 

-13 

- 6 

-13 

-22 

- 3 

- 6 

^ 2 

-16 

+ 2 

- 1 

+ 6 

+ 1 


0 

0 

+ 

4 

+ 12 

+ 1 

+ 10 

+ 2 

+ 7 

- 9 

+ 12 

0 

+ 1 

- 9 

+ 2 

~ 3 

— 

6 

+ 3 

- 

1 

-18 

-10 

+ 6 

- 6 

+ 5 

+ 9 

+ 8 

- 6 

- 3 

+ 1 

+ 7 

-10 

— 

4 

+ 6 

+ 

6 

+ 13 

- 7 

+ 1 

- 5 

+ 2 

+ 4 

- 5 

- 9 

0 

+ 1 

+ 3 

0 

- 

3 

+ 4 

- 

1 

+ 19 

+ 6 

-22 

+ 1 

-13 

0 

+ 4 

- 2 

0 

+ 4 

+ 11 

+ 10 

- 

3 

+ 9 

— 

6 

- 3 

- 1 

+ 19 

+ 13 

- 3 

- 3 

- 7 

+ 16 

+ 18 

-15 

-11 

+ 1 

+ 

7 

+ 7 

— 

2 

+ 10 

-22 

+ 13 


+ 1 

+ 6 

0 

+ 11 

- 5 

+ 12 

+ 9 

- 6 

— 

3 

+ 2 

- 

10 

+ 15 

+ 16 

- 9 

-19 

-15 

+ 4 

+ 14 

- 4 

+ 2 

-18 

- 9 

-13 

— 

7 

+ 4 

+ 

3 

+ 1 

-10 

0 

- 3 

-14 

+ 4 

- 3 

-17 

- 2 

+ 10 

- 6 

- 5 

+ 

3 

+ 10 

- 

2 

-16 

+ 6 

+ 2 

- 8 

+ 20 

+ 9 

- 5 

- 4 

- 6 

+ 21 

+ 14 

- 7 

- 

1 

- 4 

— 

1 

- 5 

-23 

-16 

- 4 

- 1 

+ 23 

-12 

+ 2 

+ 3 

+ 9 

- 1 

-12 

— 

8 

-13 

- 

1 

+ 2 

+ 9 

- 4 

+ 15 

- 13 

-10 

-12 

+ 5 

-13 

-10 

- 2 

-12 

+ 

6 

- 8 

— 

1 

+ 16 

+ 3 

- 2 

+ 9 

0 

- 9 

+ 13 

- 4 

-21 

-14 

+ 15 

+ 8 

— 

2 

+ 9 

+ 

6 

+ 15 

-10 

+ 2 

0 

+ 15 

-14 

-27 

- 3 

-14 

- 7 

-12 

+ 11 

- 

2 

- 7 

+ 

6 

+ 10 

- 3 

- 2 

- 5 

+ 15 

- 1 

- 7 

+ 13 

+ 12 

+ 6 

+ 1 

_ 4 

+ 

2 

- 4 

- 

9 

-11 

-11 

-19 

+ 5 

- 3 

-17 

-10 

- 5 

+ 3 

+ 2 

- 3 

- 9 

+ 

3 

+ 12 


0 

+ 4 

- 1 

+ 17 

+ 3 

0 

-12 

+ 8 

- 6 

- 1 

-10 

- 8 

+ 4 

- 

5 

+ 2 

— 

12 

+ 1 

- 7 

+ 1 

-19 

+ 20 

-10 

0 

- 2 

+ 4 

- 7 

- 6 

+ 1 

+ 

1 

- 2 

— 

8 

-13 

- 11 

- 8 

- 2 

0 

-16 

- 7 

- 2 

+ 8 

- 3 

- 9 

-11 

- 

8 

+ 3 

— 

6 

-18 

- 6 

-14 

- 7 

-11 

- 8 

- 2 

+ 14 

+ 6 

-12 

- 2 

-13 

— 

7 

- 1 

+ 

6 

+ 7 

+ 11 

- 7 

+ 11 

+ 26 

0 

-13 

+ 1 

- 2 

- 7 

- 3 

+ 3 

+ 15 

- 1 


0 

-10 

- 4 

+ 8 

- 6 

- 9 

- 6 

+ 2 

+ 5 

+ 8 

-22 

+ 2 

+ 9 

- 

2 

- 1 

- 

10 

- 8 

-22 

-23 

+ 18 

+ 16 

+ 14 

+ 1 

- 7 

+ 1 

- 4 

0 

- 1 

— 

1 

+ 1 

+ 

7 

-18 

- 2 

- 9 

-14 

- 6 

- 9 

- 4 

- 6 

- 6 

+ 1 

-16 

- 3 

- 

2 

- 2 


0 

-25 

+ 20 

-11 

+ 13 

+ 3 

- 3 

-12 

- 6 

+ 19 

+ 9 

+ 12 

+ 3 

— 

1 

- 8 

+ 

4 

+ 13 

+ 7 

- 11 

- 4 

+ 11 

+ 9 

+ 20 

+ 22 

- 2 

- 4 

- 5 

+ 14 


0 

- 6 

+ 

2 

- 7 

-17 

-10 

- 4 

+ 2 

- 8 

+ 3 

0 

+ 3 

- 2 

+ 1 

+ 5 

+ 

3 

- 2 

+ 

2 

+ 19 

- 3 

- 6 

-13 

+ 19 

+ 3 

+ 14 

+ 7 

+ 8 

+ 17 

+ 11 

+ 15 

— 

7 

-11 

+ 

4 

-18 

-12 

+ 6 

+ 14 

-18 

+ 2 

+ 10 

+ 6 

-12 

+ 3 

+ 3 

- 3 

~ 

4 

+ 1 

+ 

4 

+ 10 

- 8 

0 

+ 3 

+ 3 

+ 11 

+ 7 

+ 9 

-24 

- 4 

+ 10 

0 

- 

1 

+ 8 

— 

4 

+ 6 

-12 

+21 

- 2 

- 2 

+ 10 

0 

- 3 

+ 6 

- 12 

-14 

- 2 

— 

1 

- 6 

+ 

3 

+ 17 

- 5 

-19 

-14 

- 2 

+ 9 

- 2 

- 1 

- 1 

- 1 

-21 

- 2 

— 

1 

- 7 

+ 

3 

- 4 

+ 13 

- 2 

-14 

- 4 

+ 2 

-13 

+ 21 

+ 1 

+ 7 

-17 

+ 20 

— 

1 

- 8 

+ 

4 

+ 6 

-14 

+ 11 

- 8 

- 2 

- 7 

+ 8 

+ 10 

+ 16 

+ 9 

+ 15 

- 1 

+ 

3 

- 3 

- 

2 

+ 9 

-12 

+ 16 

-12 

- 4 

+ 16 

+ 7 

+ 3 

- 2 

+ 7 

+ 10 

- 4 

+ 

3 

-11 

— 

5 

+ 11 

-13 

+ 2 

- 5 

- 6 

-12 

- 8 

- 6 

- 7 

-11 

- 9 

+ 17 

+ 

2 

+ 2 

+ 

2 

-19 

+ 7 

+ 19 

- 18 

+ 10 

— 7 

+ 9 

+ 16 

+ 6 

0 

- 2 

+ 3 

-11 

+ 23 

+ 

3 

- 8 

-13 

+ 2 

- 3 

+ 12 

+ 4 

— 7 

- 8 

-15 

- 8 

- 6 

- 5 

+ 

1 

+ 8 

— 

6 

- 2 

-16 

-13 

0 

+ 14 

- 1 

+ 30 

+ 6 

- 2 

+ 2 

+ 8 

- 9 

- 

3 

- 3 

+ 

2 

- 3 

+ 13 

- 9 

-23 

- 2 

- 6 

+ 2 

-11 

+ 20 

- 8 

- 9 

+ 3 

- 

2 

- 6 

+ 

4 

- 3 

+ 11 

- 7 

+ 20 

-13 

+ 15 

-10 

+ 14 

-16 

+ 8 

-10 

-14 

+ 

3 

- 4 

- 

1 

- 8 

-10 

- 3 

+ 16 

- 2 

0 

- 7 

-16 

+ 11 

- 8 

- 5 

- 9 

+ 

9 

- 8 

- 

3 

+ 6 

+ 13 

+ 14 

-19 

- 1 

+ 6 

+ 3 

+ 7 

- 1 

+ 1 

+ 26 

+ 9 


1 

-14 

+ 

4 
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TABLE XI (continued). 


Population 1X1 

Population VI 

Population YU 

+ T ) 

_ 

1 

+ 

1 

_ 

6 

- 1 

_ 

4 

+ 4 

+ 1 

- 2 

- C 

-12 

0 

- 6 

+ 27 

- 8 

0 

+ 

4 

+ 23 

- 

4 

4 - 3 

+ 10 

+ 9 

0 

- 2 

0 

- 5 

-12 

-11 

+ 1 

+36 

+ 1 

— 

3 

— 

2 

— 

6 

0 

— 

6 

- 6 

- 5 

+ 7 

-17 

+ 21 

- 6 

- 3 

- 8 

+ 6 

-12 

+ 

1 


7 

+ 

5 

- 3 

+ 

7 

- 2 

_ 2 

+ 1 

- 9 

+ 11 

+ 20 

+ 4 

+ 10 

- 3 

+ 4 

+ 

6 


3 

— 

6 

+ 3 

+ 

9 

+ 2 

0 

- 2 

+ 11 

-16 

-17 

- 9 

- 7 

-11 

+ 1 

- 

6 

- 

1 

— 

2 

- 6 

- 

3 

+ 11 

- 6 

- 1 

-14 

- 5 

+ 9 

- 8 

+ 9 

- 3 

- 6 

+ 

1 

— 

2 

+ 

3 

- 7 

— 

5 

- 2 

+ 2 

- 2 

*11 

- 4 

-12 

+ 1 

+ 2 

-15 

- 5 


0 


0 

— 

11 

+ 1 

-fll 

0 

+ 6 

+ 1 

+ 11 

+ 36 

+ 4 

-10 

+ 17 

+ 9 

- 4 - 5 

— 

3 

+ 

3 

+ 

3 

- 6 

+ 

3 

- 6 

- 7 

+ 2 

- 2 

-15 

- 4 

+ 20 

+ 7 

+ 33 

- 3 

+ 

1 

+ 

4 

+ 

1 

+ 10 

+ 16 

- 3 

- 6 

0 

- 8 

+ 34 

+ 3 

-12 

- 8 

+ 4 

- 4 

+ 

7 

+11 

~ 

2 

- 6 

+ 

3 

- 1 

- 1 

0 

+ 19 

+ 19 

+ 19 

+ 6 

-11 

- 9 
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TABLE XII. 


Experiment B. Frequency of Different Valuee of x in Samplee of 300 . 
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Ai^endix to Section II 
TABLE XIII. 

Experiment G. Frequency of Different Values of x in Samples of 76. 



III. 

NOTE ON THE CORRELATION BETWEEN AND m;'. 

By R. C. GEARY, M.Sc. 


In the present state of our knowledge, the problem of determining which of 
two tests of normality is the more efficient, can be dealt with only by numerical 
methods. Having obtained the probability points for the rival tests for normal 
samples of different sizes, we should calculate the values of each of the test functions 
for a large number of distributions actually found in practical work and ascertain 
which of the tests gives the greater number of values lying in the respective regions 
of improbability for normal samples. As between the ^2 and w* tests*, investigation 
must be confined to samples so large that one may have confidence in the probability 
points for ^2 fbr normal samples, determined by E. S. Pearson. 

Dr Pearson's discovery, illustrated on Figs. 3 and 4 of his paper, of marked 
statistical relationships between 02 and w' in samples drawn from different parent 
universes, has obviously an important bearing on this problem. If there were a 
rigid algebraical relation between the tests, they would be equally efficient. The 
analysis which follows is subject to the qualification that the statistical relation- 
ship may be closer than the coefficient of correlation indicates. The coefficient of 
correlation — 1 will be found only when the relation is linear. Fig. 4 suggests that 
the relationship between 02 and w' for samples of given size from a given parent 
population may approximate rather to a curvilinear type. 

The only universes for which an algebraical relation subsists between 02 and w* 
are those in which there are two categories only — measuring say 1 and 0. Suppose 
that the sample proportions are p (measuring 1) and q (measuring 0). Then 

mx — p> 


W4=pg(l — 
d — 2p9. 

w' »« dl*^r}fi2 — 2 VjD^, 

02 « m4/ma“ l/pg — 3, 
ySa — 4/t^;'* — 3. 

Accordingly as w' increases 02 diminishes. 

In large samples the sets of values of the elements which minimise 02 maximise 
w\ and vice versa. If the represent the deviations from the sample mean, then 

and «;'= . 


The mean deviation 
Then 

and 
so that 


02 '- 


* [At the desire of Mr Qeaiy I have left w* standiiig throughout this section, although in the first 
section of his paper he has invariably used it with the appropriate subscript, here dropped. £d.] 
Biometrika xxvu 
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If the sample is so large that the y*s may bo considered independent*, the 
minimum value of namely 1, is found when 

which gives the maximum value 1 of w\ The maximum value of namely n\ 

and the minimum value of w' are found for the following series of values of 

Vn' 

the y’s : 

yf+ 0, 

« 0 (A? + i). 

E. S. Pearson’s Table I clearly indicates a tendency in the seven universes (to 
which the normal with ySa* 3 and w' — *798 might be added to give an eighth) for 
a high /Sa to accompany a low and vice versa. 

The correlation between ySa and w' for normal samples may be determined as 
follows. If the sample of a' = -f 1 is 

^ > • • • » ^n' > 


then 

and 

with 


, 2 I ic,: — a? I _ / n -h 1 d 

Va(n4-l)s ^ n s' 


^2 


^(nH- 1) 

__ 4- 1 ) S I — jg 1^ ^ 


n^s^ 


m 




5* — 2 \xi — ® I*, 


Indicating by sijuare brackets, [ ], universal means about fixed origin, the 
coefficient of correlation required is 

The values of [/9a] aiid are known f: 


[A]* 


3n 

71 -h 2 ’ 

Since w' and s are independent, 


= 


24 (a 4- 1) (7i — 1) {ii — 2) 
(n 4- 2)* (n 4- 4) (ti + 6) 


+_i L + _ 

with [s] = ® 4n 20n*'*'ll3nT ^ 

The last of these formulae is due to HelmerbJ. The value of will be found from 


* A solution of the problem of determining the sets of values of the n' variates which give limiting 
values of /9a and w* for all sisses of samples, and the respective limiting values, would have some 
mathematical interest. 

t Beadily derivable from formulae given by R. A, Fisher, Proc. Royal Society, Idmdon, Series A, 180 
(1981), p. 25. :J: Astronomieche Na4ihrichten, Bd. 88, No. 2096 (1876). 
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There remains the calculation of 

(n4-l)(w4-8) 


) M ’ 


in which 

Also 

with 






w- 


[dm*] - {(« + 1 ) [yi*] + n (w + 1) [yiV»]}. 

2| and y*>=|a;, — S|. 

The distribution of ^ is 

i e " Syi (oo > yi > 0), 

with o.i=__^ 

n + 1 

so that [yi»] = 8<r* v/ - • 

▼ JT 

The joint distribution of yi and ya is 

- a,,(iir7j<y.*+y»’+2w,yj) ^ . 

TTCT* Vl — p* ( J 

The value of p, the coefficient of correlation between a‘i — x and is — ~ . 

"fit 

From this distribution it has been found that the value of [yi^i] is 

All the data are now available. Upon reduction the coefficient of correlation r 
between v/ and /Sa for normal samples takes the simple form 

^ (»-l)(n~2) 




TT [s] n (n + 2) (n + 3) * 

The values of [s], and <r/s, in the denominator have been given above. The 
presence of the factors n — 1 and n — 2 in the numerator might have been 
anticipated. If either were missing, r would assume the impossible value 

00 in samples of 2 or 3, since the factors Vw — 1 and Vn — 2 occur in in the 
denominator. In samples of 2, of course, both /8a and v/ assume the same value 

1 for all samples, while /8a assumes the same value f for all samples of 3. 

When n tends towards infinity the coefficient of correlation tends towards 

1 


= -•7673 


Vl2(7r-3) 

The values of r for certain sizes of sample are as follows : 


Sioe of Sample, n 

0 

11 

26 

61 

101 

Coefficient of Correlation, r 


-•851 

- ‘809 j 

- *790 

-•778 








ON THE VALIDITY OF A CERTAIN PEARSON^S FORMULA. 

By JAN WIiSNIEWSKI. 


§ 1. In the memoir “On the Measurement of the Influence of broad Categories 
on Correlation** printed in Vol. ix. of Biometrika (pp. 116 to 139) Karl Pearson 
makes some statements which seem to need qualification. 


The notations are as follows: variables /r, y\ the class-marks of the correspond- 
ing class intervals Oy, The first statement we shall consider is the assumption 
that in each class-interval the actual arithmetic average of the variable or y is 
ef|ual to the corresponding class-mark Cx or Cy. Of course, this is an approxima- 
tion only and within its degree of accuracy only holds the ensuing formula (i) of 
the paper referred to: 


f'xCx — 




(i). 


It is more than sure that Karl Pearson was aware of the approximative character 
of form (i) when proposing it; we merely point to this circumstance in connection 
with the numerical example that will follow. 


The main point of the present note is to show the limited validity of another 
formula of the memoir discussed, viz. form (ix): 


(ix) is based upon (iv) and (viii), 


’’^XOx'^yGy 


(ix). 


(iv). 


== 


'*'OyOx 


(viii). 


The derivation of (iv) is the following (to quote Pearson’s words): “...since a given 
IV will have a constant class-mark, the correlation of y and Gx for a constant x is 
zero; that is to say that the partial correlation coefficient 


viPyCx 


'^yCx 


Vl - 


Vl — 't^xOx 


= 0 ** 


.(a). 


§2. Formula (a) and its derivation need a thoroughgoing discussion. First, 
we shall make some observations concerning partial correlation in general. Accord- 
ing to the writer’s view, partial correlation may be given two interpretations: the 
“layer” interpretation and the “projection” interpretation. 


First, we shall present the more usual “layer” interpretation. Suppose we are 
dealing with the variables w, v, U The points having coordinates U are 
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distributed in a three-dimensional space. Now if we want to hnd the partial 
correlation between u and v we divide the space into layers by means of planes 
parallel to the u, v plane. In each layer the variability of t is greatly reduced so 
that we consider it negligible. Within each layer we make a projection of the 

Vi, ti points on a plane parallel to the u, v plane and calculate the correlation 
found for the distribution of the projected points (of course the projection is made 
perpendicularly to the said plane). At last, we compute a kind of average* of 
the correlation coefficients in each layer and this is the partial correlation coefficient, 
provided the regressions of on ^ and of y on ^ are linear. If they are not strictly 
so, the partial correlation coefficient calculated according to the familiar formulae 
is at any rate an approximation to the coefficient calculated in the manner 
described above. 

The “projection** interpretation is arrived at from the former when we assume 
that layers are made thinner and thinner. Instead of making a projection within 
each layer we make one of all Ui, Vi, U points on the u, v plane. But the projection 
is no more perpendicular to this plane. It is made in such a manner that the 
coordinates of each projected point are Ui — Hi, Vi — Vi where Ui and Vi are values 
obtained from regression equations f on t. If we assume those equations , to be 
linear, we again get the classical formula (1 ^*oe) equivalent 

to the simple correlation coefficient calculated for the projected points. If ^ is a 
discrete variable and the layers are made so thin as to comprise only one value of 
t, the “layer** and the “projection** interpretations are identical. 

§3. Now we can analyse the special case of xpyOx' The distribution of the 
yi> Oxiy points is very peculiar indeed. Assume that x and y are continuous 
variables, i.e. they can take any given value within the field of variation. Cxy on 
the other hand, can take only special values. Moreover, Gx is a definite function 
of X, viz. 



where [z\ means the largest integer smaller than z, and h is the class-interval. 

For this reason all yi, Cxiy Xi points must be situated on plane “stripes** 
parallel to the x, y plane. Each “stripe** is bounded by two parallel straight lines 
which are determined as follows: 

Cx — Cxi ; X ■« Cxi + i A and Cx = Gxi\ x « Cxi — JA. 

The “layer** interpretation fails in this case completely. As a matter of fact it is 
only natural to take /r-layers identical with its class-intervals. Then within each 

* This kind of average resembles what Irving Fisher terms “aggregatio average.” In our case, 
denoting with and Vjg the arithmetic averages of u and v for the layer A;, we have 

^ ^ n/zU (m. - uu)* SS (V - ‘ 

ki • hi 

t In general, these equations need not be perfectly fulfilled. 
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layer all points will lie on a straight line parallel to the y-axis. This means that 
the correlation coefficient for each layer becomes indefinite and so is the partial 
correlation coefficient calculated according to the “layer’* interpretation. 

We would obtain the same result from the “projection” interpretation if we 
assumed that the values of Gom are determined from equation (6), which is really 
the case. By is meant the value of obtained from a regression equation. 

It seems that the only way to get for xPyOjc ^ definite value is to start from the 
“projection” interpretation and to assume 

^xi “ '^xCx ~ “ 1 " ^ Ox 

where Mx and Mcx denote the arithmetic averages of x and (7* respectively. As 
we stated, each y*, Cxiy opi point must lie on a “stripe” conforming to the descrip- 
tion given above. More particularly, for a given all points lie on a straight line 
which is defined CsssCxi as determined from (6), Such a straight line 

is, of course, situated on a given “stripe” and parallel to its boundaries. What will 
be the projection of the points lying on such a line, on the y, Cx plane? The 
projections will have a common Cx coordinate equal to Cxi “-Cxi, where Oxi is 
determined from (c), and y’s eqiial to yi — yit where is found from the regression 
equation of y on x. As Cxi — Cxi takes different values* the projected points will 
not lie on one straight line parallel to the y-axis (as in the “layer” interpretation 
case). If y is not a one-valued function of x, the projected points also will not lie 
on one straight line parallel to the Ojp-axis and, consequently, xpycx will take a 
definite value. If the regression of y and x is strictly linear, the mathematical 
expectation of y** — for each Cxi — Cxi is zero and therefore xpyCx will equal zero. 
If the regression is only “practically” linear, as is assumed by Pearson, equation (a) 
still will give a good approximation. 

The conclusion is that (iv) is true provided the regression of y on a; is linear. 
This is a sufficient condition but not absolutely necessary. We can fancy cases 
where the regression in question is not strictly linear and yet »pyOx = 0. 


§4. The derivation of (viii) is similar; namely, Pearson assumes that the 
correlation of Cx and Gy for a constant y is zero and, accordingly. 


yPcxOy = 


'^OxOy ^ '^yCx'^yOy 


0 


id). 


In order to examine the validity of (d) we shall start from the “projection” inter- 
pretation and assume that Cxi and Gyi are determined from equations analogous 
to (c). ® 


The distribution of the Gxit Cyi, yi points in space is still more peculiar than 
in the foregoing case. Each point lies on a rectilinear segment parallel to the 
y-axis and limited hy two points, having the coordinates Cx^Cxtl Cy^Cyi\ 

* If X is^a continuous variable is also one. For practical purposes we may assume, however, 

that \ Cxi-C^\ch. 



Jan W^ibwski 


859 


y^Oyi±\h (sapposiiig the class-intervals of x and of ^ to be the same). For a 
given Gyi all such segments lie on a plane ** stripe” analogous to those described in 
the preceding paragraph. The difference is that in the former case the points lay 
distributed more or less evenly throughout the stripe while in the present one they 
can occupy only determined positions on rectilinear segments parallel to the 
y-axis. 

Correspondingly the projection of the Cyi, yt points on the C*, Cy plane 
will differ from that of the former case. Let us denote, for the sake of simplicity, 


Ui » Cyi T/yi saa Cyi M ~~ (y* — (^)» 

=* Oxi — Cgfi VyOg {yi — My) (/*), 

from which follows 

Me, + K - (C^ - Mo,)] {g). 


{g) is evidently linear with respect to m. For the whole stripe we have a constant 
Cy and for each segment within a stripe a constant <7*. Therefore the projection 
of the segments of a stripe will be a set of segments parallel to each other and 
having the same field of variation of u (because the field of variation of y is the 
same for all segments, viz. from Cy — to Cy -f ^h). As VyCy is positive and near 
to unity, the angular coefficients of the projected segments will depend mainly 
on ryo*. 
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Chart 1. Distribution of observations from a numerical example. 

Data projected according to equations («) and (/). The dots on this chart do not number 75, as they 
should, because some of them represent several observations. 
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All segments of our distribution will also give a set of parallel segments*, but 
the segments taken from different stripes will have fields of variation of u of the 
same length, to be sure, yet somewhat shifted — suppose both Cyi and in ie) 


increase by h, then t/* will increase 


byA(l-r,c,^''). 


What will be the correlation calculated for the projected distribution? Off-hand 
we cannot say anything. If the points were distributed evenly in each segment, 
the correlation would surely follow the sign of the absolute value of p 

approaching zero as the number of segments for each Gy would increase. But the 
premise given above is not general; indeed, if we really have to do with “broad 
categories,” we might expect that the correlation between y and (7® will be 
observable already within given ^/-intervals, i.e. for a given Gy, The coefficient of 
correlation for the points thus projected is evidently ^ 


ypCyCx 

This cannot be zero except when 
or 


"^OyOx '^yOy'^yOx 


^ VOx — '^CyCx ^ 


^yCx 


'f'yCy = 0 


ih\ 

,{k). 


Thus wc are brought back to our starting point, (k) is clearly equivalent to (viii). 
Therefore we see that Pearson’s contention (viii) cannot be generally proved in an 
analytical way. In the best case we can regard (viii) as an approximation of 
unknown precision. The application of the reasoning used at the end of §3 is here 
impossible as the regression of Gx on y will noty in general, be strictly linear; 
consequently, the mathematical expectation of Gxi — Gxi for a given Gyi — Gyi will 
not necessarily equal zero. Especially, if some observations are distributed in such 
a manner that in a certain number of y-rows one cell only is filled in each, the 
aforesaid mathematical expectation cannot be generally zero. In such cases the 
sign of (h) will surely follow that of ^CyCx- 

The above conclusions can also be derived from a discussion of (k). If the 
distribution is such as just described rg^Ox should nearly equal ryOx> On the other 
hand, we may expect that in such cases the classification is really done in “broad 
categories,” thus the value of Vyg^ will bo comparatively small and the left-hand 
side of (k) will be positive instead of zero. We may also resort to the graphic 
presentation given in Chart 2. Suppose in some y-rows one cell only is filled in 
each, then this will be surely that cell through which passes the line of regression 
of Gx on y, this regression being assumed linear. The points of this cell, when 
projected, will lie on a segment passing near to the (0, 0) point. If the bulk of the 


* Such a projection is shown in Chart 1. It can be clearly seen that the projected points form 
a set of parallel segments. [Surely any reasonable test for outliers would throw out the five points to the 
left and the single point to the right of Chart 1 ? Ei>.] 
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observations belongs to such cells we shall have a picture like that in Chart 2— 
viz. a great mass of points situated on a narrow stripe passing near to the origin 
of the system. Those points will clearly contribute to produce a correlation coeflS- 
cient of large numerical value and of the sign of rc^Ox- other hand, the 

majority of points belong to other cells, their projections will give a picture of the 
scattered points in Chart 2, those points contributing to lower the numerical 
value of ypcycx- 



Chart 2. Distribution of observations from the numerical example. Original data. 


It should also be noticed that our conclusions would be unchanged if instead of 
(e) we simply took Ui Oyi — y*. 

§6. To sum up: Pearson’s formulae (viii) and (ix) are not generally true, and 
their degree of approximation is unknown. How serious blunders can arise, in 
extreme cases, from their application, the following example will show: 
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Numerical Example, We will study the distribution of rural districts in a 
certain Polish county according to the number of dwelling houses {SUjLtistiqus de 
la Pologne publi^e par TOffico Central de Statistique de la R6p. Polonaise, S4rie 
B, fasc. 8-b, pp. 11 — 12. Arrondissement de Chodzie^, communes rurales. Nombre 
de b&timents d’habitation). a;-data from 1921 census/y-data from 1931 census. The 
original data are given in the table below : 


X 

y 

X 

y 

X 

y 

X 

y 

X 

y 

2 ;) 

22 

5 

6 

33 

30 

19 

17 

36 

35 

41 

42 

66 

67 

62 

64 

40 

40 

30 

31 

52 

60 

41 

44 

41 

42 

28 

28 

21 

26 

30 

30 

66 

76 

8 

9 

83 

82 

31 

33 

24 

24 

13 

13 

37 

32 

73 

70 

114 

135 

83 

84 

18 

18 

120 

134 

64 

64 

30 

31 

34 

36 

20 

20 

34 

31 

66 

67 

105 

118 

20 

22 

9 

9 

26 

27 

46 

60 

13 

13 

37 

41 

20 

20 

32 

27 

30 

33 

70 

72 

20 

18 

28 

30 

176 

172 

46 

50 

33 

32 

10 

9 

8 

8 

44 

43 

105 

107 

66 

66 

91 

96 

66 

64 

67 

60 

99 

98 

24 

23 

61 

62 

61 

59 

6 

6 

60 

47 

126 

122 

32 

32 

166 

179 

43 

41 

24 

21 

92 

89 

126 

116 

16 

16 

47 

42 

20 

20 

21 

21 


The data are also charted and given in graphic form (Chart 2). The cross-classifica- 
tion has been executed in intervals of twenty houses, including the upper class-limit. 
The figures given in the graph are the frequencies in the several cells. 

The values of the parameters of the distribution are: 

X Ca, y Cy 

Arithmetic average M 47*93 48*10 48*71 48*63 

Standard deviation <r 35*92 36*66 37*45 36*84 

N^lb, r*y»*9934, = ‘98’79, r^e* = -987l, rj,Oy = *9877, Vyo^^-dSlO. 

All r*6 are computed directly, i.e. making no use of the quoted Pearson's formulae. 
Now in the light of this example we shall test the validity both of the formulae 
mentioned and of the limitations we suggested. The particular example we chose 
seems to be well adapted for this purpose. 

Mge and Mc^, differ widely, so formula (i) cannot be expected to give good results. 
Indeed, r\of actually comes out greater than 1! On the other hand, the 
difference between My and Mq^ is rather slight and correspondingly •r^yOy « *9837, 
which is not far from the value calculated directly. 

As the regression of y on a; is very nearly linear, form (iv) should give correct 
results. As a matter of fact, *9938. This discrepancy between this value 

* We denote by r* those coefficients which are calculated indirectly, i.e. by means of Pearson’s 
formulae. 
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and the one given above is within the margin of error due to rounding off 
decimals. 


Because of the existence of many y-rows containing only one cell that is not 
empty, formulae (viii) and (ix) must register foilures. Thus apoyo. instead of zero 
equals *626, r^J{rga,ryo^) > 1 ; royaji/mcg^^vo,) “ *9919 which happens to be in the 
right direction, but we wonder whether one would not hesitate to use r'’s greater 
than unity. 

Lastly, we shall test Pearson’s formula (xv) devised for the case when, as 
usually, we know very little about the distribution within class intervals. 


“ »Ox 


1 


12aa« 


(XV), 


where h is the class-interval. Here again ro^J(y'xOx^"vOy) > 1- 

One might object that the differences between theory and application are so 
minute as not to deserve attention, but the corrections to crude coeflScients as 
proposed by Pearson are themselves, in general, of a rather small order. 



ON THE CORRECTIONS FOR BROAD CATEGORIES, BEING 
A NOTE ON MR WISNIEWSKI’S MEMOIR. 

By K. PEARSON. 

I PROPOSE to discuss Mr Wisniewski's criticism of my method of correcting for 
broad categories, by dealing first with the numerical example which he provides. 
I may state that I consider that example a very extreme one to test the accuracy 
of my corrections upon, and the reasons for this will appear in the course of my 
analysis. 

Mr Wisniewski s data consist of the numbers of inhabited houses in certain 
rural districts or arrondissements in Poland at the date of tw^o censuses. There is 
no district without a house, the lowest number being five, and the distributions 
at both censuses are crowded into the ranges 5 to 60. Thus the frequency distri- 
butions of the two variates are both extremely skew, we are ignorant of where 
they start, and the sample consists of 75 rural districts only, so that the standard 
errors of the constants involved are very large, and no allowance is made for anj’^ 
of these peculiarities. The case is, however, said to be singularly suitable for 
testing my formulae. 

In Table I the actual data are given, where I must remark that the groups 
of 20 houses are included in the group 0 to 20, and the group 20 to 40 does not 
include the 20 and does include the 40 and so on. 

Hence, if this discrete series is to be replaced by a continuous variate we must 
take our groups to be 20*5 to 40*5 and so on. Mr Wii^niewski does not appear to 
notice that my formulae are deduced on the assumption of continuous and not 
discrete variables. Even when we replace discrete by continuous variables the 
start of the continuous variables is left in the air. Now I show in my original 
paper that iny correction for the correlation coefficient reduces to Sheppards 
correction, when the sub-ranges are all equal, as they are in this case (except the 
doubtful first category) and when we accept the hypothesis of Sheppard, i.e. that 
high co'ntact at the terminals of the distribution holds. Now in Mr Wisniewski's 
data there is no high contact at the zero ends of the frequency distributions, but 
on the contrary great abruptness. Hence the standard deviations involved in the 
value of the correlation must be corrected in a manner different from that involved 
in taking 



a relation which depends on the high contact hypothesis. It is overlooking this 
point which leads him to obtain a corrected correlation coefficient slightly over 
unity. 
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Taking the data as given in Table I and working with the 75 individual cases, 
I find, a being the horizontal and y the vertical variate, 

Means W - 47-933,333, ^ « 48*706,667, 

Standard Deviations 

= 36-914,466, o-,, - 37-464,959, 

Product Moment CoeflScient» 1336*353,777, 
providing the correlation coefficient 

-993,442. 

Mr WiiSniewski gives the values 

^ = 47-93, 48*71, 35*92, = 37*46, r^«-9934, 

which nearly agree with the above values, although they are taken to an inade- 
quate number of figures having regard to later work. 

Now the problem before us is to obtain these results, or results close to them 
from a contracted table, i.e. one of broad categories, and so save the tedious process 
of dealing with the individual 75 entries in computing the coefficients. But the 
corrections for grouping observations together have hitherto been based on con- 
tinuity of the variates, except in those proposed by H. C. Carver for grouping 
discrete variates and these fail in the present case, as we shall see, precisely as 
Sheppard’s fail, and for the same reason — they disregard the abruptness. In order 
accordingly to correct the contracted table we are bound to suppose the frequencies 
represented on it result from the sampling of a continuous distribution. In the 
next place we are at liberty to choose our grouping, and the safest grouping 
experience shows is that which starts close to the lowest observation in the data ; 
in this case 5, 5. Let us take our grouping accordingly in sub-ranges of 20 from 
4*5. We then reach Table II. 

TABLE II. 


x-variate 



J 


>0 

T 


S' 

a 

3- 

7 

2- 

’-i 

1 

>-i 


Totals 


21 









21 


1 

23 

1 

— 

— 

— 

— 


— 

2.5 

J^'B — 04* B 

— 

— 

13 

— 

— 

— 

— 

— 

— 

13 

64'B~-~84'B 




5 

— 

1 


— 

— 

5 

84^B—li)4'B 

— 1 

— 

— 


3 

— 

— 

— 

— 

3 

104-B-~124-B 

— - 1 

— 

— 

— 

— 

2 ! 

2 

— 

— 

4 

1^4*B—144'B 

— i 

— 

— 

— 

— 

2 

— 

— 

— 

2 

144-B^164-B 

— i 

— 

— 


— 

— 

— 


— 

— 

164-B—184-B 

— 

— 

— 

— 

— 


, — 

f 

2 

2 

Totals 

22 

23 


5 

__!j 

_!j 

2 

. "j 

2 

75 


Afvnal* of Mathematical StatUtic$t Vol. i. p. 111. 
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On the Corrections for Brocbd CeUegoiries 

The peculiar nature of the distribution upon which Mr Wisniewski proposes to 
test my corrections will now be obvious to the reader. There is extreme abrupt- 
ness at the low end of the range for both variates, and the usual, or Sheppard’s 
corrections, are bound to fail. However, let us see how the results work out. We 
will take our working axes through 64-6, 54’6, and our working unit 20 houses. 
Summations being from working origin. We find 

Vx ^ ~ *32, * (y) = - *32. 3-573,333, * 8 (y») - 3 546,666, 

and 8 {xy) =* 3*62, and q^y =* 3*52 — *1024 = 3*4176. 

Hence, making no corrections, we have 

<r«* « 3-470,933, V = 3-444,267, == 3-4176, 

- 1-863,044, a-y - 1-865,874, = 3-467,576. 

Accordingly the crude uncorrected Vg^ is given by 

= qxyltr^Ty = -988,438. 

Now Sheppard has shown that if the sub-ranges are equal and sufficiently small 
as compared with 0 -^ and cTy*^ and there is high contact at both ends for continuous 
variates, then qcey requires no correction and (sub-range)* must be subtracted 
from and 0 *^*. I have shown that under like conditions my class index cor- 

rections rg,Cx reduce to 

the X and y sub-ranges respectively. These lead if the conditions stated above 
be supposed to hold to precisely the same result as Sheppard’s corrections. 

Using Sheppard, we find 

= 1*840,6433, <ry = 1*833,2850, ay = 3*374,240, 

and accordingly 

- 3*4176/3*374,240 = 1*012,85. 

Thus these corrections fail, and the reason for failure lies not so much in the fact 
that h is more than half < 7 ^, as in the absence of high contact at the start of the 
ranges of both x and y. Before we turn to the correction for abruptness, let us see 
what Carver’s formulae will provide. 

Now if there be u discrete values in the sub-range, Csuwer subtracts 



from the standard deviation squared, i.e. *0831,2500 instead of *0833,3333, like 
Sheppard. This gives us 

1*840,6999, cry « 1*833,3418, and <ra: 0 *y « 3*374,4487. 

Thus the corrected r*y = 3*4176/3*374,4487 *= 1*012,78. 




where h and k are 


This is certainly not fulfilled in the present data. 
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We see again that the correlation is greater than unity, and with twenty 
discrete values in the sub-range Ccurirer’s formula scarcely improves on Sheppard’s, 
which treats them as continuous. Thus the failure does not arise from treating 
discrete grouping as continuous. 

The source of error clearly lies in neglecting the abruptness. Allowance for 
abruptness has been made by Pairman and Pearson* and put into a convenient 
form by Sandonf, but possibly the best summary is that provided in Part II, 
p. cxciii of the Tables for Stcutistioiams and Biometricians. Assuming high contact 
at the end of the range and moments to be taken about the start, we have to 
correct the means and the standard deviations of the two variates by the formulae 


where 




Hi 




ll,153ni - 12,566nt + 1 0 .lTeti, - 4,586n« + 863nt 
60,480iV 

l,830iti — 4,358n, + 4 , 110 n 3 — l,962n4 + 380nc 


60,480-y 

Here »ii, n*, «*, n. and n« are the first five sub-frequencies from the abrupt 
terminal, and N is the total frequency = 75 in our case. Noting, that for ai, 
wi = 22, «* = 23, «• = 14, lit — 5, ns = 3, 


and that of y. ni = 21, na = 25, na = 13, n. = 6 , ns == 3, 
we find *jgr,. a=+ 0172,9960, «Ha,o = - 0024,4797, 

yHi,a= - 1 - 0070,5688, - - -0056,7901. 

Accordingly, - 2-6 - -32 - 0172,9960 =. 2-1627,0040, 

aMi' - 2-5 - -32 - 0070,5688 - 2-1729,4312. 

The corresponding values from the origin (2-5, 2-5) are 

- _ -3372,9960, - - -3270,5688, 

and accordingly the corrected 9 ,^ is given by 

qxy~‘^S(aiy) - X 3 62 - -1103,1615 = 3 4096,8385, 

Again 

(®») + 5 + 6-25 = 3-5733,3333 - 160 ■¥ 6 25 = 8-2233,3333, 

*Ma' - 8-2233,3333 - -0833,3333 - (- 0024,4797) = 81424, 4797, 

“ 8-1424,4797 - 4-6772,7302 - 3-4651,7495, 
and « 1*8614,8804. 

Again 

= Vt iS (y*) 5 + 6-25 - 3-5466,6667 - 160 + 6 25 - 81966,6667, 

aft,' = 8-1966,6667 - -0838,3333 -I- 0056,7901 = 81190,1234, 

•r* =* v/*i' - vMi'* * 8-1190,1234 - 4-7216,8180 = 3-3973,3054, 

<ra» 1-8431,8489. 


* BiometHkaa Vol. xn. pp. 288, 280. 
t Biometrikay Vol. xvi. pp. 198 — 196. 


and 
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Thus 


and 


= 3-4310,8506, 
_ 3-4096,8386 


cToia'f, 


3*4310,8606 


*993,763, 


and this differs from the true value *993,442 by *000,321, Even this might have 
been got rid of had a more elaborate attempt been made to determine the probable 
start of the marginal frequency distributions. 

Now let us examine what Mr Wisniewski has done. He begins by assuming 
that my class-marks C* and Cy are the midpoints of the sub-ranges. On the very 
first page of my memoir, I start by saying that I am going to take the mean of 
the group of individuals in the «th class as my class-mark. I then proceed later 
in the memoir to discuss the conditions under which it is possible to replace the 
summations involving Xg, yt by corresponding summations of the midpoints a?,, yt 
of the sub-ranges as class-marks. Mr Wisniewski pays no attention to these 
conditions, never enquiring whether they are satisfied or not, and thus reaches 
corrected values of the correlation which exceed unity, which they may easily do 
as the uncorrected correlation of his suitable case is already *988 and over. I can 
only think that he has not reacl, or if he has done, not understood the meaning 
of my memoir. On pp. 120 and 128 it is stated that the vanishing of certain 
summations depends upon high contact at the terminals or on the truth of Shep- 
pard *8 hypothesis. When this hypothesis does not hold, then, as is well known, 
corrections have to be made for ‘‘abruptness.” 

It is idle to say how unusable these formulae are if we take as an illustration 
a case expressly excluded by the author of them. But in these excluded cases, 
if proper corrections are made for the means and standard deviations, we find that 
the short table gives a result close to the true value obtained from the full data. 

Mr WiiSniewski never tells us what he means by a “class-mark.” I understand by 
it a term which is common to every individual in a broad category, and any quantity 
may be used as the class-mark which is constant for every member in the group. 
Thus the mean x of the group may be used, or the mid-point of the sub-range, or 
the median of the values of x in the sub-range, I start my discussion by saying I will 
take the first of these as convenient, but pass on in the course of the paper to consider 
the conditions under which we may use other values to represent the “class-mark.” 

I now come to the criticism of my formulae and the degree of their approxi- 
mation, using of course the value of for which they are established. 

In the first place, let us consider Formula (i) of the previous memoir 

(i). 

“as 


Let X and (« Xg) be measured from the mean of the variate x^ and let rig be 
the frequency in the ^th class, S a summation within the class, and S a summation 
for all classes. Then by definition 


_8(G^^Xg) _8(G^ngC^) 







O-x ' 
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The formula is thus absolutely true and not approximative at all, and Mr Wis- 
niewski’s remark "It is sure that Karl Pearson was aware of the approximative 
character of this formula (i) ” appears somewhat wide of the mark. 

Let us turn now to a second formula for the partial correlation of 

Mr WiSniewski gives a somewhat lengthy discussion of the meaning of a partial 
correlation coefficient*. I should interpret the relationship here with discrete x 
and y to be as follows : If we pick out for a given x all the corresponding y*a and 
then the product moment S (y — yai) (Oa?—- (C'fl>)ar*co«»t.) must vanish. This it clearly 
does. For Oaf is a constant when x is constant, and S (y) ngty a,, both factors of the 

product being independent for constant x. There is accordingly, in my mind, no 
doubt of the absolute truth of 

Ox “ 

In the same manner 

xv^OxCp 9 

is absolutely true, for the correlation of two constant quantities is zero. The 
question then arises as to how far we may replace these partial correlation co- 
efficients by the customary expansion formulae for linear regression. This is 
discussed at considerable length in my memoir, and involves the relationship 
between the three following summations 

^ {^^sV^y and 

where rigt is the frequency in the iigt cell, i.e. that common to the sth column and 
fth row, Xg, yt are the midpoints of the sth column and ^th row, and Xg, yt are the 
means of the zr-constants of the ^th column and of the y-constants of the ^th row, 
namely, they are what I term on the first page of my memoir Ox and Cy^ the class 
indices. 


On p. 123, the fundamental equation (xvii) is reached, namely 

.. Viyr <T^<ry) 

“ 

or, in our present notation. 


.(xvii), 




^Ox^Oy '^XOx'^yOy 


.(xvii lyis)^ 


which amounts to saying 

wv^'OxOy 9 

when that partial correlation coefficient is expressed in the ordinary linear formj. 


* As far as I can follow Mr Wisniewski, he is only indioating the difference long recognised by 
biometrioians between * singular ’ and * plural ’ partial correlations. 

t All quantities are for the remainder of this note to be measured from the means of the x and y 
variates. t See footnote p. 118 of my memoir. 

Biometrika xxvii 


24 
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But what are the expressly stated conditions under which (xvii his) will be a 
close approximation to the true result ? 

(i) The most important is high contact at both end terminals of the total 
ranges, i.e. cases in which the Sheppard corrections are adequate. This condition 
has been entirely forgotten or disregarded by Mr Wisniewski, and he chooses as a 
suitable example a case where there is abruptness of a high order at one terminal 
of both the x and y distributions I 

(ii) In deducing (xvii his) we have nowhere assumed that the regressions of (7* 
or Cy on X or y are linear. The regression of x on Cfn and of y on Cy are a series 
of points on 45® lines. The regressions of (7* and Cy on x and y respectively are 
staircases, the midpoints of the treads being on 46® lines, approaching more and 
more nearly to straight lines as the sub-ranges h and k are reduced. But these do 
not for the present concern us*. The only assumption of linearity made is that 
of a? on y (see top of p. 123 of my memoir). By the nature of things this must be 
closely fulfilled in Mr WiiSniewski's example. 

(iii) Is there any other assumption made in deducing (xviiftw)? Yes, one that 
is again not referred to by Mr WiiSniewi^i. (xvii his) is a very approximate formula, 
but it depends on fourth order terms being negligible as compared with second 
order terms. The term in question is 

676cra,<ry \JV n* / 

Evaluating this terra numerically for Mr Wifiniewski’s data, we find that it 
amounts approximately to *0005, or will only influence the fourth decimal place 
in the correlation coefficient, while the Sheppard corrections will modify the second 
decimal place. Accordingly we may consider our results to be sufficiently approxi- 
mate, if we neglect this fourth order term in this example as suggested on p. 122 
of the memoir. 


It follows that the reason for Mr WiiSniewski’s getting corrections which raise 
the corrected correlation above unity lies in his overlooking the condition (i) of 
formula (xvii his), namely, that there must be high contact at both 
terminals of the marginal totals. When the proper corrections for abruptness are 
made in the means and standard deviations, then the coefficient of correlation does 
not exceed unity, but on the contrary is close to the value obtained by the long 
process. 


It may be as well here to show the relation of (xvii) to the closely approximate 
formula 

(xviii). 


* EIxoept in bo far as the first set provides another easy proof of (i). We have : value of x for 

a given but it also equals since regression is linear C?,. Accordingly r^cm “w®* equal 

/ ^Ox 
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Both formulae follow at onoe, if we aasume linear regressions to hold for all the 
variables a?, y, (7», Cy, But these not being all linear, my memoir discusses the 
hypotheses under which they will hold with a very considerable degree of approxi- 
mation, namely, (i) high contact, (ii) the regressions of oc and y linear, and (iii) the 
product and squares of ^ negligible as compared with those 

quantities themselves. Under these conditions, (xviii) will now be shown to be an 
approximate as (xvii his). 

Starting from (xvii) and using the approximate expressions and yt in terms 
of Xf and yt on p. 119 of my memoir, we can at once write it in the form 


* »y 








1 + 












.1 * o r, - Wt-xM 

--is; — JJ 


' + 1L7) [’ 


.1 ^ a 1 

24 ^ — MV\j 


rit+i - ??t~i 

Tit 


24 (TogO-y V Tit ) 


hk 


67 6 o'Ox^v 


S 


(ngt — n,«.i rtt^x — 

\N n, ~ fit ' J]’ 


In the second summation we may replace ac^ by o** for 
and the may be neglected in a turn of the fourth order. Accordingly the 

third summation is negligible, and even in Mr Wisniewski's case only contributes 
roughly *0005 to the right-hand side of the equation. 


Now consider the first summation, keeping t constant or sticking to a definite y, 
« mean x for a given yt = but if x and y have linear regression, then 


nt 


and the first summation becomes 


yu 


“ (^*-1 ^t+i) 

IT ** V 


X* A / 1 


If there be high contact at the terminals, and in this case only, the first sum- 
mation vanishes, and the second and third equal unity. Accordingly, neglecting as 
before terms of the fourth order of small quantities, the first summation becomes 


which is equal to 


12 57 <ra, V 12aJ* 12 a-/ 


24—2 
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to our degree of approximation. Thus 


or 

or, finally, 


/t . A;* \ P cTc^ 

0 -* v'^ 1 2«ry* cr, 

(l + = (l + 12^) 


^* 1 / ^ 


which are to be proved. In other words, with our assumptions we get the two 
relations 

'^yCx^ Ox ^ '^Cx Oyl (X^tOx'^v Oy) 

as close approximations; these relations are those of linear regression singular 
partial correlation coefficients. 

The conditions of these results holding are stated in my original memoir, the 
most important one being that of high terminal contacts, which has been entirely 
overlooked by Mr WiSniewski. It has long been known that with very high cor- 
relations VfxCx ^yCy occasionally over coiTect. But in almost every case 
where they do so, it will be seen, or it may be suspected, that there is abruptness 
at some one or other terminal which must be allowed for. We have shown how this 
may be done in the example selected as an illustration of failure by Mr Wisniewski. 
I have, however, to thank him for his paper, because it has enabled me to em- 
phasise again the conditions under which my formulae are approximations, and to 
indicate those formulae which are absolutely true. It seems to mo on re-reading 
my memoir that I have clearly therein stated these points, but if Mr Wisniewski 
has not grasped them, others may likewise fail to do so. I should welcome any 
alternative means of correction for broad categories when high contact fails, but 
more than twenty years’ experience has shown me that the class index corrections 
give very reasonable results when high contact limits the frequency distribution. 
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Malar Bonos, (xv) Nasal Aperture, (xvi) Asymmetry, (xvii) Jrregularity and Maloc- 
clusion of the Teeth, pp. 379 — 382. 14. Brief Description and Discussion of the 

Principal Cranial Characters: (i) Cranial Capacity, (ii) Length -breadth (cephalic) Index 
(100 R/A), (iii) Length -height (altitudinal) Index (lOO/f'/A), (iv) Height-breadth Index 
{\00 B/ (v) Upper Facial Index (100 0*HjGB\ (vi) Cranial Typos in the Series, 

(vii) Prognathism, p. 385. 15. Mandibltss : Material available for Study, p. 385. 

APPENDIX TO PART I. 

I. Individual Measurements of the Male Anglo-Saxon Crania from Bidford-on-Avon, 
with Remarks. 

II. Individual Measurements of the Female Anglo-Saxon Crania from Bidford-on-Avon, 
with Remarks. 

III. Individual Measurements of the Male Anglo-Saxon Mandibles from Bidford-on- 
Avon, 

IV. Individual Measurements of the Female Anglo-Saxon Mandibles from Bidford- 
on-Avon. 

* By the courtesy of the Bidford Co-operative Society the skeletal remains are permanently pre- 
served in the Museums of Human and Dental Anatomy in the University of Birmingham. 
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1, Introduction. The bones which are the subject of this communication were 
first brought to light in June 1921 during the cutting of a new road at Bidford- 
on-Avon, Warwickshire. In the course of this work a number of skeletons were 
disinterred along with various articles which indicated the site of an Anglo-Saxon 
Cemetery. Subsequently it was revealed that the cutting for the road traversed 
an extensive cemetery, but in 1921 the circumstances were such that no accurate 
note was preserved of the position of the various graves. The small number of 
bones available as the result of this initial discovery were merely collected together, 
many broken in the course of the digging, and, since the relation of the limb and 
other bones to the skulls was not determined, the exact number of graves opened 
during 1921 cannot be accurately known. 

Arrangements were made, however, for a controlled excavation of the site, 
which was carried out during the following summers of 1922 and 1923. Details of 
the methods and course of the excavations, with a discussion of the site, its relation 
to other Anglo-Saxon cemeteries and of the evidence which suggests that the 
Bidford Cemetery is to be assigned to the early part of the sixth century, are to 
be found in two publications in Archaeologia* by Mr John Humphreys and his 
collaborators in the work of excavation. The first of these papers includes a map 
of Bidford showing the exact site of the cemetery which is thus described ; “ The 
excavation took place on a site at the back of the High Street, on the north side, 
in a gravel plateau about 100 yards from the present bank of the river Avon, 
150 yards from the Roman Rycknild Street, and 200 yards from the old ford, 
which lies to the east of the church. The site is on the extreme western border of 
Warwickshire, within a mile of the Worcestershire boundary.^* There are also 
plans of the cemetery, a table giving details of the individual graves and of 
the grave furniture found with each interment, and three excellent photographs 
of skeletons and grave furniture in situ. The papers in Archaeologia are mainly 
occupied with a detailed and splendidly illustrated description of the archaeological 
material provided in abundance by the excavations ; and we cannot do better, for 
the purpose of linking the archaeological to the anthropological side of the discovery, 
than quote the paragraph in which the authors summed up the results of their 
detailed work. 

“ The evidence brought to light at Bidford, judging by the size of the graveyard 
already explored, the number of bodies, and the wealth and variety of the grave 
furniture, suggests a much earlier settlement in the heart of the Midlands than 
has generally been admitted, and the great number of cremation burials does not 
agree with the theory that the proportion of cremation burials decreases the 
further the settlements advanced up the valleys of the Thames and its tributaries. 
The materials show so strong a resemblance to those found in the Anglo-Saxon 
settlements in the Thames valley and its head -waters, that we are forced to 
conclude that the West Saxons, after penetrating the Cotswolds, descended into 

* “ An Anglo-Saxon Cemetery at Bidford-on-Avon, Warwickshire,*^* by Humphreys, J., Byland, 
J. W., Barnard, E. A. B., Wellstood, F. C., and Barnett, T. G. Archaeologia, 1024, ucxiii. 89—116. 

** Second Beport on the Excavations.’* Idem, Ibid. 1925, lxxiv. 271-268. 
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the Avon valley, and established themselves on the north bank of the river Avon 
at Bidford quite early in the sixth century.’* 

** Notes on the Cranial and other Skeletal Characters” have already been 
published as Appendices to the general accounts in Arohaeologia^ and the purpose 
of the present paper is to provide data and measurements which may be of use to 
those who may in the future essay the task of surveying all Anglo-Saxon remains 
discovered in England and coordinating the scattered evidence of the physical 
features and interrelations of the various groups in different parts of the country 
Acknowledgments of support by the Society of Antiquarians, the Birmingham 
Archaeological Society, and the University of Birmingham, and also to those who 
personally assisted the work, by giving help at Bidford, by contributions to the 
Excavation Fund, or by assistance in archaeological matters, have already been 
made in detail in the papers in Archaeologia 

2. Number of Interments, A perusal of the two papers in Archaeologia makes 
it obvious that the Bidford Cemetery, from the archaeological point of view, must 
take an important place in the list of Anglo-Saxon discoveries. A preliminary 
statement of the numbers of skeletons which it has yielded will show that it is 
equally important in its anthropological aspect. In the table that follows, the 
probable age and sex distribution of the better-preserved remains are indicated ; 
the number of interments represented by fragmentary bones only and the number 
of urns found are also stated. 


Bidford-on-Avon (1921-22-23). 



Age 

6 

9 

Sex? 

Totals 

r 

Preserved 

Bones < 

1 Fragmentary 

tr ragmontary 

6 

12 

20 

“if” 

«01d»^ 

( “Young” 

[ “Adult” 

6 

32 

16 

3 

7 

3 

20 

15 

5 

13 

5 

25 

21 

3 

20 

9 

2 

12 

10 

26 

120 + 

25 

21 

12 

72 

40 

10 

12 

30 

31 
120 


Totals 

64 

61 

248 + 

373 


“J/ ” = Mature. 


* Ck>ntour traoixigs of six (6) of the Bidford skulls have already been published by Professor F. G. 
Parsons, ** Anglo-Saxon Skull Contours,*’ Royal Anthrop, Institute^ Occasiotial Papers^ No. 9, 1928. 
Tracings of twelve (12) others are available in the Library of the Department of Anatomy, University of 
Birmingham. All these are indicated in the table of individual measurements. Tables of measure- 
ments of the limb bones are also available in the Department of Anatomy, University of Birmingham. 
[The contour tracings referred to above are of no service for comparison with the long series of contours 
published in Biometrika, as the farmer are dioptographic, and so are not true contours, i.e. plane sections 
of the skull. Ed.] 
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The total number of interments uncovered during the excavations of 1922 and 
1923, as determined by the discovery of well-preserved skeletons, of fragmentary 
bones, and of urns, was 229. The grand total of 373 is reached by adding 24 for 
the number of interments represented by the bones which were collected in 1921*, 
during the cutting of the road which led to the discovery of the cemetery, and 
120 for the minimum number of urns discovered in fragmentary condition through- 
out the period of the excavations. The number assigned to skeletons discovered in 
1921 is also a minimum figure, since no record of the position of each skeleton 
was made and the bones were mixed ; so that it has not been possible to assemble 
the skeletons with any certainty. The number is in fact derived from the minimum 
number of individuals determined from the skulls and mandibles alone, and agrees 
fairly well with the extent of the area excavated and the average spacing of inter- 
ments determined in the controlled area in 1922 — 23. The total number of 
interments excavated in 1922 — 23, which yielded adult bones in a reasonably 
fit state for accurate observations and measurements, was about 45. 

3. Sexing of Bones. The sexing of the bones has presented the usual difficulty, 
from the purely osteological point of view, of the residuum of doubtful specimens. 
Where whole skeletons have been preserved the pelvic bones have been taken as 
the most trustworthy indicators, and the information thus obtained of the range 
of variation of the other bones in skeletons of practically certain sex has been 
used to guide the decision where pelvic bones were not available. It has indeed 
been found that secondary sex characters are very well marked. A further and 
valuable check has been utilised in the character of the grave furniture where 
recorded, and this evidence alone has been taken for the purpose of the table in 
a number of instances. In no instance has there been serious conflict between the 
archaeological data and the osteological indications ; the proportion of the sexes 
shown in the table may therefore be taken as approximately correct. A number 
which remain doubtful are indicated as such in the tables, but a sex diagnosis 
has been supplied for the doubtful skulls of which measurements could be taken. 

4. Age Distribution. The system that has been adopted is to place the adults 
in groups according to indications supplied principally by the wearing of the teeth 
and the state of the cranial sutures, and to group the younger skeletons around 
the ages 6, 12 and 20, according to the state of the eruption of the teeth and the 
condition of the epiphysial junctions. Beyond a certain point the determination 
of age must naturally be approximate only. The eruption of the teeth and the 
condition of the epiphyses are trustworthy guides up to maturity, but beyond that 
point only a rough grouping is possible. It is in any case probable that, on the 
average, age changes would be earlier and more pronounced than in the skeletons 
of modern people. The largest number of skeletons is found in the “ Mature ” 
class — fully adult with moderate wearing of the teeth and still open cranial 
sutures. The “ Mature + class includes those that show commencing signs of 

* The Bkulls discovered in 1921 are designated by letters in the Appendix Tables; those found in 1922 
and 1923 by numbers. 
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ageing — ^such as the closure of the cranial sutures associated with increased wear 
and loss of teeth ; and those in which these signs are unequivocal are placed in 
the Old ” clauss, which includes a fair proportion of the whole. 

The youngest skeleton is that of an infant of about nine months. It is not 
possible to say what the extreme range of age may have been, but we may assume 
that most of the “ Mature ” and the “ Mature -f ” classes would be between 25 and 
40. Although the majority fall into these two classes, it is obvious that there is 
a considerable proportion of young persons and an appreciable number of old. 

Apart from all other evidence, the conclusion that is inevitable, from a con- 
sideration of the total number of individuals represented and their age and sex 
distribution, is that we are dealing here with the burial place of a community. 
But in the absence of evidence of the length of time during which the cemetery 
was used and of knowledge of the probable death-rate, it is not possible to estimate 
and hazardous to speculate about the size of the community of which it was 
undoubtedly the burial place. 

6. The Mode of Burial and the Depth of the Graves, In most of the burials 
it was possible to determine the position in which the body had been interred. 
In the majority the grave had been made sufficiently long to allow the body to be 
placed in it at full length, but in some the skull was found to be pressed down 
upon the trunk and the foot bones lay alongside the bones of the leg as if the 
body had been pressed into a grave too short for it. There appears to have been 
no exception to the rule that the bodies were buried lying on the back, sometimes 
with an inclination to the left side. The skull was usually found inclined to the 
left, though in not a few it was turned to the right. In a considerable number 
the legs had been crossed at the ankles, generally the right over the left ; and 
in the male burials certainly the position of the arms was fairly constant. The 
right arm was found to be bent at the elbow, the hand usually folded over the 
chest, but in some skeletons resting on the right shoiilder with the elbow fully 
Hexed. The left arm, in contrast, was usually found extended by the side, the hand 
in some instances resting on the left thigh or on the lower part of the trunk. 
The different attitude of the two arms is to be explained by the presence of the 
spear on the right side, the right arm being naturally folded over it to retain it in 
position. It is even probable, when the right hand was found to be in the vicinity 
of the right shoulder, that the spear had actually been placed in the grasp of the 
hand. {Archa^ologia, 1924, Figs. 2 and 3.) 

The average depth of burial was found to be about 3 ft., the minimum being 
2 ft. and the maximum 6 ft. 

6. Orientation of the Graves, Full details of the orientation of the interments 
are to be found in the Archasologia publications and in the maps accompanying 
them. Here it need only be mentioned that some 70 % of the bodies were found 
with their feet pointing between N.W. and N.E., and that the direction between 
S. and N.W. appeared to have been carefully avoided. In the part of the cemetery 
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that is judged to be of later date, there appears to have been a distinct tendency 
to the E. and W . position ; and there is no doubt that this orientation is associated 
with a comparative lack, and, in some instances, a complete absence of grave 
furniture. 

7. Grave Furniture. The spear head, the shield umbo, and the knife were 
characteristic of the male interments ; and the women had been buried with their 
necklaces of amber and glass beads, and jewelled brooches. Occasionally a small 
pot was found close to the head of a female skeleton on the right side, and usually 
there was a small knife on the breast or at the side. 

It is an interesting fact that the grave furniture was found to be much less 
plentiful in the later stages of the excavations ; and this is believed to represent 
the later, possibly post-Christian, portion of the cemetery. 

8. Cinerary Urns. The association of a considerable number of urn burials 
with ordinary interments is a feature usual in cemeteries of this date, and of 
very great interest and significance. It has often been stated that um burial 
is an Anglian, inhumation a Saxon, characteristic. The proportion of urns might 
therefore be held to indicate the degree of Anglian admixture ; but in all West 
Saxon cemeteries that have been investigated, some urn-burials have been found. 
At Bid ford thirty-one more or less perfect cinerary urns were found. Like the 
interments, they were not arranged in any definite order, but were found scattered 
about irregularly among the skeletons. The urns were found at an average depth 
of 2 ft., and the fact that fragments of at least 120 were found, in addition to the 
smaller number of more or less perfect specimens, is taken to indicate that the urn- 
burials date from an earlier period than the inhumations. It is conjectured that 
the majority of the urns were then broken up in making the subsequent graves. 
The evidence, however, is not quite conclusive that the two methods were not in 
simultaneous use. Full detailed descriptions and illustrations of the cinerary urns, 
all of which were hand -made, are to be found in the publications in Archaeologia, 

9. Associated Animal Remains. The majority of the animal bones recovered 
during the excavations were clearly of modern origin, but it is possible that a few 
of the bones of pigs, horses, sheep and oxen may be contemporary with the human 
remains. 

10. The State of Preservation of the Bones. The condition of the bones is very 
uneven. Many of them are in excellent condition, though porcelain-like and 
very brittle. Others are very soft and friable, with flaking and crumbling of the 
surface. Many of these have had to be treated with balsam varnish to prevent 
them falling to pieces. Portions of the same skull or long bone are also frequently 
found to exhibit these extreme conditions of preservation. The excavators of the 
site attribute these facts to the varying nature of the subsoil. The general good 
preservation is thought to be due to the excellent drainage provided by the gravel 
deposit, 20 ft. in thickness, upon which the bodies rested. The accumulation of 
stagnant water was prevented, and the calcareous nature of the gravels is held to 
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have neutralised acidity from the atmosphere. Scattered throughout were irregular 
pockets of boulder clay and it is believed that the sofbening of many of the skulls 
and other bones was due to contact with these clay deposits, decaloification having 
been brought about by the action of sour stagnant water unable to drain away 
from the surface of the boulder clay. 

The number of skulls and other bones available for accurate study, on account 
of the bad preservation of many of them, consequently falls far below the number 
of interments. When, in addition to this factor, there is taken into account the 
number of urn burials, it is found that from a cemetery that is estimated with 
some degree of accuracy to have contained the remains of at least 373 individuals 
there are available for the determination of their anthropological characters the 
skulls and other bones of some 70 individuals only. 

The total number of skulls available for measurement, many of them incomplete 
in different respects, is 51, of which 82 are male and 19 female. 

11. Sea; Characters. There is no doubt that secondary sex characters are very 
well marked in the skulls. There are very obvious differences in the size of the 
mastoids and in the muscular impressions in the occipital region ; and the super- 
ciliary eminences are very extensively developed in a number of male skulls. In 
a few there is even a filling up of the usual triangular depression between the 
superciliary eminence and the zygomatic process of the fiontal bone. This is 
noticeably the case in No. 107 ; and in No. 36, an old male skull, much distorted, 
there is even the appearance of a moderate torus supraorbitalis. 

12. A^e Changes. There is little of special note to record with regard to the 
age changes in the skulls. It is probable, though the evidence is by no means 
decisive, that closure of the sutures commenced at a comparatively early age. The 
first sites, as a rule, appear to be the usual ones at the lower end of the coronal 
suture with the pterion region generally, and the hinder part of the sagittal suture, 
though there are some exceptions. Arthritic changes in the region of the condyles 
and in the glenoid fossae are almost the rule in those skulls with extensive closure 
of the sutures. 

13. Remarks on Cranial Anomalies, (i) Suture Patterns and Sutured Bones. 
The patterns of the sutures in general are of average complexity with the usual 
scattered sutural (Wormian) bones in the lambdoid suture. 

(ii) Metopism. There are three examples of definite metopism in the series, 
Nos. 17, 63, 82. In two of these the suture is closed internally but leaving definite 
evidence of its presence ; and in both of them there are also indications of com- 
mencing external union. In both, these signs are distinctly in conformity with the 
age of the skulls, and union is less advanced than in the other sutures. 

(iii) Metopic Ridge. Two of the skulls. Nos. A and 106, have slight metopic 
ridges and in one (D) there is a marked metopic boss occuppng the position of the 
glabella. 
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(iv) Pterion, There is only one example of squamoso-frontal suture in the 
whole series — on the left side of skull B — in which the junction measures 16 mm. 
There are several examples of epipteric ossicles ; single right in No. 92, joined to 
the sphenoid: double right in No. C; double left in No. 60 ii ; and almost 
symmetrical single on both sides in No. 37. 

(v) Interparietal, There are sutural bones of fairly large size in the region of 
the lambda in a number of skulls ; a large one occupying the lambda in No. 93, 
a child ; three symmetrically placed in No. 107 ; three on the right, one of which 
is sagittal, in No. D; four in No. 50 i; and a series in No. 82. In only one skull 
(D) are these ossicles large enough to be interpreted as portions of the interparietal 
bone. Two large ossicles occupy the upper half of the right portion of the squamous 
occipital and one of them extends across the middle lino, being placed im- 
mediately behind the sagittal suture. There are in addition two imperfectly closed 
sutures in the occipital bone running on each side upwards and towards the middle 
line from below the astorion. These indicate the separation of the interparietal 
from the supraoccipital portion of the squamous occipital, and the whole is an 
example of ossification of the interparietal from several centres. 

(vi) Bregmatic Ossicles, There are two fine examples of bregmatic ossicle 
(os antiepilepticum). In No. 5, a male, the ossicle lies entirely behind the coronal 
suture, measures 27 mm. in antero-posterior length by 15 rnm. in maximum breadth, 
and is slightly constricted in the middle. In No. 89, female, the ossicle indents 
the line of the coronal suture but lies almost entirely behind it. It measures 
19 mm. in antero-posterior length by 14 mm. in breadth. In neither of these 
examples is there any sign of the union of the ossicle. In No. 82, a me topic skull, 
the interfrontal suture joins the coronal to the right of the sagittal and there is 
evidence of the former presence of an anterior bregmatic ossicle which has united 
to the left half of the frontal bone. The association of metopism with the develop- 
ment of a bregmatic ossicle is noteworthy. 

(vii) Asterionic Ossicles. There are also two examples of largo asterionic 
ossicles, and it is probably more than a coincidence that these occur in the same 
two skulls as the bregmatic ossicles — Nos. 5 and 89. They are both on the left 
side and show signs of uniting with the surrounding bones. 

(viii) Ohelionic Depression, There are five examples of a groove along the 
sagittal suture in the region of the obelion. In three of these (Nos. 40, 75 i, and 
78) the groove is well marked, in one of them (No. 75 i) it extends to within 2 cm. 
of the bregma and is also continued on the occipital bone as a slight depression 
from lambda to inion. The other two examples (Nos. 46 and 66) are slight, and in 
No. 46 it is associated with symmetrical depressions extending on each side along 
the lambdoid suture. One skull (No. C) exhibits a fairly obvious sagittal crest. 

(ix) Post-coronal Sulcus, There is no definite example of this in the whole 


series. 



J. O. Brash and M. Young 


381 


(x) Coronal Ossicle, There is one example of ossicle in the coronal suture, on 
the right side just below the temporal line ; this occurs in No. 82, already noted 
as a me topic skull with an anterior bregmatic ossicle. 

(xi) Oarotico-Clinoid Foramen, In No. 37, owing to damage to the skull, it 
may be observed that there is a complete carotico-clinoid foramen on both sides. 
It may be noted that this is associated with symmetrical epipteric ossicles. 

(xii) Par occipital and Paramastoid Processes, There is a paroccipital process 
in No. C and a nipple>like paramastoid process in No. D, both on the right side. 

(xiii) Ossified Trochlea, The trochlea is ossified in both orbits of No. 60, a 
skull which does not show any definite signs of ageing. 

(xiv) Divided Malar Bones, There is no example of divided malar in the series, 
but it may be rioted that both the malars are transversely grooved in No. A. 

(xv) Nasal Aperture. Some notes on the form of the lower border of the 
anterior (pyriform) aperture of the nose will be found in the remarks appended to 
the Table of Measurements. It is worthy of mention here that a complete sharp 
limiting margin is uncommon in these skulls, that prenasal fossae are present in 
several, and that in one (No. E) there are infranasal grooves (prenasal sulci) of 
anthropoid type. 

(xvi) Asymmetry, As is the case in any collection of skulls, there is a con- 
siderable amount of asymmetry to be observed, especially in the region of the base. 
It is probable that the immediate causes of asymmetry are to be found not only 
in the well-known asymmetrical development of the occipital lobes of the cerebrum, 
but also in individual habits in the “wear of the head*' and in the use of the jaws. 
The correlation of the different points in which a skull may be asymmetrical would 
be a very interesting study ; one or two points as shown in this series may be indi- 
cated. The occipital region has been noted as obviously asymmetrical in 7 skulls ; 
the left is more projecting in 6 of these, and the right in only 1. In 5 of the 
6 skulls with left occipital prominence it is also noted that the right mastoid 
process is larger than the left. Taking the skulls as a whole the mastoids are 
noted as asymmetrical in 18, and of these the right is larger in 10, the left in 8. 
The occipital condyles are noted as lisymmetrical in 6 skulls, the right condyle 
being anterior to the left in 2, the left anterior to the right in 4. There is, 
however, no obvious association of position of the condyle with either occipital 
prominence or larger mastoid. The glenoid cavities have been noted as asym- 
metrical in 13 skulls and of these the right is deeper than the left in 5, the left 
deeper than the right in 8. Again there is no obvious relation between these facts 
and the other asymmetrical points noticed. The last point in which asymmetry 
has been noted is in the size of the nasal bones ; this is definite in 8 skulls and 
in all but one of these the right bone is the larger and passes over the median 
plane at its frontal articulation. It may be surmised that this condition is correlated 
with forward projection of the frontal bone occurring in skulls with a well-marked 
left occipital projection ; but as a matter of fiict in this series asymmetry of the 
frontal bone itself is not at all obvious, and the asymmetrical nasal bones occur. 



382 Anglo-Saxon Shulls from JBidford-on-Avon 

with two exceptions, in skulls which do not show left occipital projection, and 
these two exceptions include the only example of a larger left nasal bone. 

(xvii) Irregularity and Malocclusion of the Teeth, It should be recorded here 
that the Bidford series has been the subject of a special investigation of the 
incidence of irregularities of the teeth and of malocclusion. These conditions, in 
mild or severe degree, have been found in a surprising number of the skulls; details 
and discussion will be found in “The Aetiology of Irregularity and Malocclusion 
of the Teeth” (Dental Board of the U.K. Lectures, J. C. Brash, 1929) and “Some 
Notes on the Dentitions of Anglo-Saxon Skulls from Bidford-on-Avon with 
special reference to Malocclusion ” {Trans, B.S.S,0.^ K. C. Smyth, 1933). 

14. Brief Description and Discussimi of the Principal Cranial Characters, 
The main object of Part I of this paper is to place on record accurate data 
concerning the cranial and mandibular characters of these Bidford Saxons. The 
principal measurements of the individual skulls that were sufficiently well preserved 
for taking at least a number of the measurements included in the scheme of 
measurement usually followed at the Biometric Laboratory, University College, 
London, and in accordance with the technique practised there are tabulated in the 
Appendix with brief notes on any distinctive or peculiar features of particular skulls 
that seem worthy of mention. 

Tables of the average values of these characters for the separate sexes with the 
numbers of observations on which the averages are based are given in the text on 
pp. 390 — 391. As the numbers of skulls on which the means are based rarely exceed 
and are usually considerably below 20, the standard deviations — the measures of 
variability — of the characters have not been considered sufficiently representative 
or reliable for tabulation. 

A selected series of the cranial characters will be considered individually 
and in the aggregate later, when a comparison is made between them and the 
corresponding characters in the groups of Anglo-Saxon skulls from Burwell in 
Cambridgeshire and from the London Museums, but some of the more general 
features of the skulls as a group will be indicated here by reference to a few of the 
principal cranial characters. 

(i) Cranial Capacity, In only 11 of the skulls, 6 male and 5 female, did the 
condition of the specimens warrant the risk of estimating the cubic capacity by 
the direct method, No. 8 shot having been used as a medium. These are entered in 
the table of measurements of individual skulls. For the 6 male skulls the average 
cubic capacity was 1563 c.c. and for the 6 female skulls 1378 c.c. The mean 
capacities for male and female skulls in the series were also estimated from the 
product of the serial means of the three dimensions, greatest length (A), maximum 
breadth {B) and basio-bregmatic height {H*) by using Miss Hooke’s formulae*, 
namely : 

cT C (in c.c.) = *000,366X151/' -f 198*87, 

? C (in C.C.) « 000,366 XBH' + 199*43. 

* Biometrika, Vol. xviii. (1926), p. 38. 
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The values thus obtained, 1595*7 for the male and 1411 c.c. for the female, are 
rather in excess of the mean capacities obtained from the smaller numbers of 
specimens hy the direct method*. 

(ii) Cephalic Index (100 BfL), Omitting one or two skulls which were obviously 
distorted, the cephalic indices were obtained for 29 skulls, of which 10 were female. 
In the males the index ranges from 69*5 to 77*8 with a mean value of 73*6 ; in the 
females the range is from 68*4 to 76*9 with a mean of 73*8. The distribution of 
the skulls according to type is as follows : 


Class 


Dolichocephalic 
Mesati cephalic 
Brachycephalic 


Range 

Numbers 

Per cent. 

6 

9 

6 9 

d and 9 

-75 

13 

6 

19 

66*5 

/75— 77-4 

3 

4 

7 

24*1 

\ 77 . 5 — 79*9 

3 

0 

3 

10*3 

80 + 

0 

0 

0 

0 


Nineteen of the skulls, 66 %, fall below the 76 mark and are therefore to be 
reckoned definitely dolichocephalic ; the remainder fall within the mesati cephalic 
class, not a single brachy cephalic skull being included in the series. It is also of 
interest to note that on a division of the mesaticephalic class into two subclasses 
at 77*6, as suggested by Sir William Turner, in order to indicate the afiSnities in 
the intermediate group, the proportion that may be considered to show a distinctly 
long-headed tendency is almost 90.% of the total number. We are therefore clearly 
dealing with a sample of a strongly dolichocephalic community. 

Length-height Index (lOOH* jL), The classification of the skulls according to 
this index is follows : 


Class 

Range 

Numbers 

Per cent. 

6 

9 

S and 9 

^ and 9 

Chamaecephalic 

-70 

0 

2 

2 

11*1 

Orthocephalic 

70—75 

10 

2 

12 

66*7 

Hypsioephalic 

75 + 

1 

3 

4 

22-2 


* [It should be noted here that Miss Hooke’s formulae are based on the Biometric Laboratory 
method, which inyolves the use of mustard seed. The results obtained by the use of shot are not 
comparable with those obtained by mustard seed, witness the exaggerated values obtained by Broca’s use 
of shot. Nor is it legitimate to put into the product term in Miss Hooke's formulae the product of the 
serial means, for the product of the means is not the mean product. Ez>.] 
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From this point of view the skulls are predominantly of the orthocephalic or inter- 
mediate type. In the males the length-height index ranges from 70*0 to 76*2 with 
a mean value of 71*8, and in the females from 66*8 to 76*9 with a mean value 
of 72*4. 

Height-breadth Index For this index we have the following distri- 

bution : 





Numbern 

I Per cent- 

ClaRs 

Range 

6 

9 

^ and 9 

6 9 

Hypsistenocephalic 

-100 


2 


29*4 

P laty ch amaeceph al i c 

1(M) + 

8 

4 

12 

70-6 


Th(i majority thus fall into the platycharnaecephalic class. In the males the 
index ranges from 98*5 to 109*3 with a mean value of 102*2; in the females the 
range is from 96*3 to 112*2 with a mean value of 103*5. 

The impression conveyed from a study of the foregoing cranial indices is that 
we are dealing with skulls in which on the whole a low mean cephalic index, i.e. 
dolichocephaly, is associated with a definite preponderance of breadth over height. 

It may be recalled here that this combination of features was pointed out by 
Sir Wm Turner as characteristic of his series of Scottish skulls*. He then re- 
marked: “A striking feature of the Scottish crania, therefore, was the preponderance 
of the cephalic index over the vertical index, notwithstanding the considerable number 
of dolichocephalic skulls in the series, and in this respect the crania favoured the 
brachycephalic rather than the dolichocephalic type. The Scottish skulls are platy- 
chamaecephalic.” 

Upper Facial Index (100 O^HjGB). According to the value of this index, the 
skulls may be distributed as follows : 



The majority of the specimens thus show a relatively broad face. In the males the 
index varies from 66*3 to 80*3 with a mean value of 72*4; in the females the range 
is from 64*4 to 77*1 with an average of 71*5. 


Trails. Roy. Soc, Edin.^ Vol. xl. Part IH. 
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Normae faciales of Anglo-Saxon Skulls from Bidford-on-Avon. 
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Plate III 



Normae yerticales of Anglo-Saxon Skulls from Bidford-on-Avon. 




Normae oceipitales of Anglo-Saxon Skulls from Bidford-on-Avon. 
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Prognathism, Inspection of the skulls after orientation in the Frankfort plane 
conveyed the definite impression that some of the females, if not the majority, 
were relatively prognathic as compared with the males. As the mean value of the 
profile angle (Z. P) in the males in which this measurement was taken was 85*2^, 
whereas in the females it was 83*1^, or 2 degrees less, there would appear to be a 
reasonable basis for the belief. Further reference will be made to this feature later 
(pp. 396—396). 

Cranial Types in the Series. In the brief notes on the cranial characters appended 
to the first publication in Archaeologia, it was suggested that the distinction to be 
drawn between the definitely dolichocephalic skulls and those with a mesatice> 
phalic index was associated with an obvious and striking difference in the appearance 
of the skulls in norma vertioalis, and it was there stated that “there are found to 
be marked differences in the amount of parietal bulge, so striking in the well- 
characterised males as to lead to an initial distinction of two types best described 
as ‘ovoid’ and ‘ellipsoid’.” With the greater number of skulls now available, it is 
clear that while the sharpness of this distinction has been modified by the addition 
of intermediate forms, yet it is a distinction which stands confirmed. The two 
skulls, Nos. A and 5, which have been chosen for reproduction in four normae, are 
perhaps the best examples in the series of these two types which have been 
designated “ovoid” and “ellipsoid,” the ovoid being relatively broad and low, and 
the ellipsoid relatively high and narrow. While this differentiation into two types 
is mainly based on their relatively great difference in parietal breadth, which can 
be clearly observed not only in norma verticalis but also in the corresponding 
facial and occipital normae, the two types differ in other features notably in the 
characters of the facial skeleton. Though there seems to be, from the cranioscopic 
point of view, a reasonable basis for the establishment of a valid distinction between 
types within the series, it should be noted that the variabilities shown by the 
characters A (o- * 5*43), B {a ^ 6*97), 100 BjL (cr « 2-74) and 100 OH' /QB (tr - 4 84) 
in the male group are not greater than those shown in the. corresponding characters 
in the London Museums series of Anglo-Saxons or the Whitechapel seventeenth- 
century Londoners, both of which have been considered fairly homogeneous groups 
of skulls. 

The larger question of the bearing of these observations upon the problem of 
the origin of these Bidford-on-Avon people can be adequately discussed only by 
someone who may review together all the recorded data on Anglo-Saxon remains. 

Mandibles, After a careful preliminary survey of the mandibles that had been 
excavated, it was concluded that the total number of specimens represented in 
whole or in part was probably about 86. Some of these were not associated with 
crania. 37 of this total were fairly complete and adult or aged. Of these 20 were 
identified as male, 13 as female, and 4 of aged individuals were considered of 
doubtful sex. 17 of the total were less complete. Of these 10 were male, 6 female 
and in 1 the sex was doubtful. 23 adult specimens were only represented by 
firagments, and 9 more or less complete were obviously juvenile. 

Biometrika xxvii 
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When it was decided to take detailed measurements of the adult mandibles, 
50 specimens, 31 of which were identified as ^ and 19 as were found to be 
sufficiently complete to provide a number at least, if not the majority, of the usual 
measurements. 

These mandibles were measured in accordance with the scheme of measure- 
ment now followed at the Biometric Laboratory, University College, London. As 
compared with the original scheme which is described in detail in Biometrika, 
VoL XIV. pp. 253 — 260, the number of characters of which measurements are taken 
is considerably reduced. We are indebted to Dr G. M. Morant for the abbreviated 
list of measurements now usually taken. The measurements of the individual 
mandibles are given in detail in the Appendix (Tables III and IV). The mean 
values of the characters measured and the indices computed from these with the 
numbers of observations on which the averages are based are shown for males in 
Table VI and for females in Table VIL A detailed comparison of the mandibular 
characters individually and in the aggregate with the corresponding characters in 
the two series of Anglo-Saxon mandibles from Burwell and from the London 
Museums will be given later (pp. 398 — 404). 
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APPENDIX III. INDIVIDUAL MEASUREMENTS OP MALE ANGLO-SAXON 
MANDIBLES PROM BIDPORD-ON-AVON. 


Serial 
Letter 
or No. 


9,^0 


zt 

c,l 

ml 

V 

rb ' 

nhpi 

h 



rl 

1 M 

IM 


100 

ml 

100 

w 

ml 

100 

xoo 

w 

100 

1/ 

A 

— 

108*0 

104*5 

48*0 

22*0 

102*0 

82*0 

32*0 

29*5 

32*5 

31*0 

8i*o 

67*5 

111*0 

8g*o 

72*5 

79*4 

102*5 

1317 

103*3 

47*4 

F 

123*5 

92*0 

105*0 

45*5 

20*5* 

105*0 

86*0 

34*0 

28*5 

38*0 

29*5 

76*0* 

71*0* 

110*0 

82*0 

67*0 

72*4 

100*0 

107*0 

87*6 

47*9 


— 

— 

— 

48*5 

— 

— 

82*0 

36*5* 

28*0 

31*0 

32*5 

69*0* 

— 

115*0 

— 

60*0 


— 

— 

— 


K 

— 

93*5 

95*0 

— 


101*0 

79*0 

33*0* 

27*5 

32*5 

25*5 

63*0 

— 

115*0 

— 

67*0 

62*4 

94*1 

1x8*4 

98*4 

— 

Y 

125*0 

95*0 

106*0 

45*0 

24*0 

107*0 

79*0 

33*0 

29*5 

35*5 

30*5 

75*0 

70*0 

1160 

79*0 

73*0 

70*1 

99*1 

120*3 

89*6 

47*1 

2 a 

125*0 

99*5 

108*0 

48*0 

22*5 

107*0 

82*0 

36*0 

27*5 

29*0 

28*5 

70*0 

70*0 

ii6*o 

74*0 

66*5 

65*4 

100*9 

121*3 

92*x 

51*4 


114*0 

94*0 

102*0 

44*5 

20*5 

103*0 

76*0 

30*5 

27*5 

32*5 

27*0 

68*5 

6o*o 

120*0 

78*0 

66*0 

66*5 

99*0 

123*7 

92*2 

50*8 

5 

123*0 

96'0 

107*5 

48*5 

21*5 

104*0 

78*0 

32*0 

29*0 

31*0 

27*5 

72*0 

73*0 

115*0 

72*0 

79*0 

69*2 

103*4 

123*1 

89*3 

43*8 

11 

ii6*5 

82*0 

97*5 

40*5 

19*5 

101*0 

75*0 

29*0 

— 

27*5 

— 

63*5 

6o*o 

121*0 

73*0 

59*0 

62*9 

96*5 

109*3 

84*1 

48*3 

48 

— 

81*0 

— 

43*5 

— 

122*0 

76*0 

32*0* 

27*5 

31*5 

28*5 

66*0* 

71*0* 

135*0 

60*0 

— 

54*1 


io6*6 

— 

45*1 

50 

115*5 

85*5 

87*5 

44*0 

— 

101*0 

73*5 

30*5 

29*5 

30*0 

28*0 

66*0* 

68*5* 

n8*o 

68*0 

70*0 

65*3 

86*6 

ii6*3 

97*7 

44*5 


102*0 

92*0 

86*0 

42*5 

— 

110*0 

74*0 

29*0* 

29*5 

30*0 

27*5 

57-0* 

60*0* 

128*0 

6i*o 

66*0 

51*8 

78*2 

124*3 

107*0 

48*3 

60 

— 

— 

— 

43*0 

— 

— 

78*0 

37*5* 

26*0 

31*0 

285* 

62*0* 

— 

1200 

— 

71*0 


— 

— 

— 

— 

66 

130*5 

103*0 

105*0 

50*5 

20*0* 

100*0 

82*0 

32*5 

28*5 

33*0 

29*0 

74*0 

66*5* 

110*0 

88*0 

62*5 

74*0 

105*0 

125*6 

98*1 

48*9 

70 

110*5 

96*0 

88*0 

42*5 

— 

92*0 

61*0 

26*5 

28*0 

26*0 

230 

60*0 

6o*o 

125*0 

69*0 

60*0 

54*3 

95*7 

157*4 

io9*x 

44*2 

78 

108*5 

102*5 

94*0 

44*0 

18*0 

107*0 

77*0 

27*5 

— 

27*5 

— 

57*0* 

57*0 

129*0 

— 

— 

53*3 

87*9 

133*1 

109*0 

48*2 

82 

127*0 

io6‘5 

— 

51*0 

— 

110*0 

77*0 

33*0 

27*5* 

34*0 

31*0* 

65*5 

6o*o 

124*0 

77*0 

68*0 

59*5 

— 

138*3 

— 

55*0 

84 

116*0 

92*0 

97*0 

51*0 

22*5 

112*0 

79*0 

33*0 

30*5 

39*5 

33*0 

— 

62*0 

125 0 

— 

75*0 


86*6 

ii6*5 

94*8 

53*2 

92 

112*5 

90*0 

92*0 

44*5 

18*5 

111*5 

78*0 

34*5 

26*5 

36*5 

35*5 

66*5 

72-5 

120*0 

64*0 

71*0 

57^8 

80*0 

115*4 

97*8 

47*6 

106 

ii8*5 

101*5 

91*5 

43*5 

18*5 

103*0 

77*5 

32*0 

29*0 

29-5 

30*0 

56-0 

64*0 

113*0 

55*0 

68*5 

54*4 

88*8 

131*0 

110*9 

50*0 

107 

120*0 

97*0 

97*0 

49*5 

23*5 

109*0 

86*0 

35*0 

29*0 

34*0 

30*5 

76*0 

71*5 

ii6*o 

73*5 

69*0 

69*7 

89*0 

112*8 

100*0 

49*0 

113 

— 

94*5 

100*5 

46*0 

— 

111*0 

83*0 

34*0 

29*0 

33*5 

27*5 

64-0 

— 

121*0 

— 

68*5 

57*7 

90*5 

113*9 

94*0 

— 

116 

— 

95*5 

99*0 

46*5 

16*5 

112*0 

82*0 

32*5 

28*5 

35*5 

29*5 

— 

67*5 

121*0 

— 

64*5 


88*4 

116*5 

96*5 

48*1 

118 

— 

— 

99*5 

48*5 

18*0* 

113*5 

790 

33*0* 

26*5 

32*5 

30*0 

8o*o* 

71*0* 

123*0 

82*0 


70*5 

87*7 

— 


46*5 

1-7 

— 

103*0 

— 

48*0 

23*5 

105*0 

73*0 

32*0 

27*5 

32*5 

29*5 

— 

66*5 

127*0 

— 

73*5 

— 


141*1 

— 

48*1 

147 

124*0 

102*5 

— 

49*0 

20*5 

106*0 

82*0 

36*0 

28*5* 

32*0 

29*5 

66*0* 

64*0 

1x7*0 

73-0 

69*0 

62*3 

— 

125*0 

— 

56*3 

157 

— 

92*5 

85*0 

42*5 

16*0* 

101*0 

76*0 

32*0* 

26*5 

34*0 

23*0 

55*0* 

52*0* 

124*0 

68*0 

66*5 

54*5 

84*2 

121*7 

io8*8 

61*5 

169 

127*0 

103*0 

103*0 

46*5 

23*0 

107*0 

75*0 

33*0 

31*0 

38*0 

31*0 

64*0 

63*0 

1230 

69*0 

73*0 

59*8 

96*3 

137*3 

100*0 

52*4 

172 

— 

— 

— ' 

45*5 

— 

— 

■ — 

— 

26*0* 

30*0 

28*0* 

63*5* 

— 

— 

— 

73*0 

— 

— 

— 

— 

— - 

183 

102*0 

80*5 

83*0 

44*0 

19*5 

101*0 

74*0 

32*5 

30*5 

28*0 

24*0 

57*0 

64*0 

117*0 

60*0 

68*5 

56*4 

82*2 

108*8 

97*0 

50*8 

185 

112*0 

90*0 

104*0 

47*5 

22*0 

106*0 

82*0 

34*5 

26*5 

31*5 

30*0 

72*0 

1 

70*0 

115*0 

72*0 

69*0 

67*9 

98*1 

109*8 

86*5 

49*3 


* These measurements were taken on the right side. 
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49*0 

24*0 

109*0 

84*0 

37*0 

28*5 

.30*5 

27*5 

65*0 

62*0 

120*0 
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* These measurements were taken on the right side. 
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PART II. THE BURWELL SKULLS, INCLUDING A COMPARISON 
WITH THOSE OP CERTAIN OTHER ANGLO-SAXON SERIES. 

By Doris Layard and Matthew Young. 

CONTENTS. 

PART 11. 

1. Introduction, p. 388. 2. Cranial Material available for InveBtigation and 

Measurement, p. 389. 3. Mean Values and Variabilities of the principal Cranial 

Characters and a Comparison with the corresponding Characters in the Aiiglo>Saxon 
Crania from Bidford-on-Avon and from the London Museums, p. 392. 4. Values of 

a for a selected Series of Characters showing the Features in which Differences in the 
throe Cranial Series are most pronounced : (i) Males, (ii) Females, p. 392. 5, Co- 

efficients of Racial Likeness for the Cranial Series: (i) Males, (ii) Females, p. 396. 

6. Mandibles: Material available for Measurement, p. 398. 7. Mean Values and 

Variabilities of the principal Mandibular Characters and a Comparison with the 
corresponding Characters in the Anglo-Saxon Mandibles from Bidford-on-Avon and 
the London Museums, as well as the Mandibles from Dunstable, p. 398. 8. Values 

of a for a selected Series of Mandibular Characters showing the Features in which 
Differences in the Mandibular Groups are most emphasised: (i) Males, (ii) Females, 
p. 400. 9. Coefficients of Racial Likeness for the Mandibular Series: (i) Males, 

(ii) Females, p. 402. 10. General Discussion and Summary, p. 404. 


APPENDIX TO PART II. 

V. Individual Measurements of the Male Anglo-Saxon Crania from Burwell. 

VI. Individual Measurements of the Female Anglo-Saxon Crania from Burwell. 

VII. Individual Measurements of the Male Anglo-Saxon Mandibles from Burwell. 

VIII. Individual Measurements of the Female Anglo-Saxon Mandibles from Burwell. 

1. Introduction, Burwell lies to the north-east of Cambridge, on the borders 
of Cambridgeshire and Suffolk. The presence of a burial-place in this situation 
was first discovered in 1884, when fourteen skeletons with funerary objects were 
unearthed in the process of working the Victoria Lime Pits which are cut in the 
side of a low chalk hill that rises here out of the Fen. The land adjoining the pits 
is let out to allotment holders and belongs to Dr Charles Lucas, who, in 1925, 
requested the Cambridge Antiquarian Society to undertake excavations. These 
were carried out in five successive seasons under the direction of Mr T. C. Leth- 
bridge, who published interim reports of his results in the Cambridge Antiquarian 
Society 8 Communications, Vols. xxvii, xxviii, xix and xxx, 1926—1929. A 
detailed account of them appears in Recent Excavations in Anglo-Saxon Cemeteries 
in Cambridgeshire and Suffolk*. 

* T. C. Lethbridge. Cambridge Antiquarian Society Quarto Puhlicatiom, New Series, No. III. 
Cambridge, 1981. 
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During the excavations about 125 graves were exposed. The skeletal material 
was deposited in the Anatomy Museum at the University of Cambridge, and the 
grave^goods are preserved in the University Museum of Archaeology and Ethnology. 

The graves were scattered in^egularly about the site, and in no case encroached 
on one another ; Mr Lethbridge suggests that each may have been covered formerly 
by a low mound or barrow, which would have served to mark its position. The 
graves were shallow and cut in the chalk, their depth being determined by the 
thickness of the soil above the chalk. In the majority, the bodies were orientated 
with heads towards the west. The grave-goods were poor both in quantity and 
quality, and are remarkable for the fact that the ornaments are all of Kentish 
workmanship and belong to types known to be prevalent in Kent at the end of the 
Pagan period. 

After reviewing in detail the material from Burwell and comparing it with 
material from other Anglo-Saxon cemeteries in the neighbourhood, Mr Lethbridge 
has come to the conclusion that the Burwell cemetery is that of a Christian Anglo- 
Saxon community of the seventh century. 

We are indebted to Professor H. A. Harris and Dr W. L. H. Duckworth for 
granting permission and providing facilities for the measurement of the skulls. 

2. Cranial MateHal available for Investigation and Measurement. Though the 
skeletons that were excavated were numbered serially from 1 to 140, the number 
of adult crania that were sufficiently complete after reconstruction to permit of some 
at least of their principal measurements being taken was only 67. Of this number 
46 were considered to be male and 22 female. The measurements of the juvenile 
skulls, some of which were considerably distorted, were all excluded from the 
averages. The principal measurements of the individual crania are tabulated in 
detail in the Appendices V and VI, and the mean values of these characters for 
male and female groups of crania are given in Tables I and II. For the male 
series, where the number of crania on which the mean values of characters are 
based is usually in excess of 30 and frequently in the neighbourhood of 40, the 
standard deviations and coefficients of variation of the different characters have 
been calculated ; to these mean values and standard deviations their standard 
errors are appended. As the means of the characters in the female series are, 
with a few exceptions, based on less than 20 observations, the standard deviations 
have not been deemed sufficiently representative for insertion in the table. For 
comparison with the mean values of the cranial characters of these Burwell Anglo- 
Saxons and of the Bidford-on-Avon Anglo-Saxon skulls measured by J. O. Brash 
and described in Part I of this paper, the corresponding means of the Anglo-Saxon 
crania from the London Museums which were measured and described by G. M. 
Moran t^ have been tabulated. 

The Burwell male series of crania may be considered fairly homogeneous, since 
the standard deviations of a selection of the principal characters are not larger than 

* Biometrikaf Vol. xvxir. (1926), p* 56. 
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Anglo-Saaeon Skulls from, SurwfM 
TABLE I. 

Comparative Table of Meane for Male Crania. 


Gharaoters 

Nob 

Bar well 

liondon 

MaBeums 

Bidford’On- 

Avon 

Means 

a 

V 

Means 

Means 

C 

F 

L 

li 

3' 

H' 

OH 

LB 

Q' 

3 

aV, 

^3' 

U 

OH 

Gil 

GB 

J 

NW 

Nil, R 

Nil, L 

NB 
(\\ R 

Ox, R 

0 ,, L 
<>2, R 

L 

Ox 

G^ 

O 2 

OL 

fml 

fmh 

100 BIL 
\iyOBIF 

100 H ’lL 

100 BIH’ 

100 G'HjGB 
100 G'HIJ 

100 NBINII, R 
100 NBjNH, L 
100 NBjNlF 
100 6»2/C>,, R 
100 0^1 Ox, L 
lOOfmblfml 

100 O^lGx 

L.P 

i^N 

LA 

lB 

L 62 

Oe. /. 

40 

44 

45 
45 

44 
40 
37 
40 
37 

30 
42 
42 
42 
42 
42 
42 
42 
42 
34 

31 
31 
24 
34 
34 
34 
34 
12 
30 
33 
30 
33 
11 
14 

29 

33 
37 
37 

45 
44 
40 
40 

30 

1 21 

34 
34 
34 
30 

33 
37 

11 

34 
32 
32 
32 
32 
32 

42 

[1539*12 ± 10*24 ■ 
187*19±0*78 
189*02 ±0*84 
141 *68 ±0*82 
95*22 + 0*51 
130*29+0*91 
114*09+0*81 
102*38 ±0*83 
312*86±1*86 
317*OG± 1*90 
381 *64 ±1*58 
128*38 ±0*94 
130*80+1*13 
122*55 ±1*15 

11 2*69 ±0*73 
117*36 ±0*89 
99*79 + 0*85 
527*76 + 2*01 
115*84±1*14 
09*02 + 0*90 
94*71+1*02 
134*02 ±1*31 
50*15 + 0*60 
50*12+0*62 
50*04 + 0*02 
24*12 + 0*31 
39*67+0*61 
42*02+0*34 
41*85 + 0*30 
32*78+0*42 
32*77 ±0*35 
48*50+ 1 *05 
44*57 ±0*87 
40*41+0*48 
94*11 ±1*01 
37*32 ±0*60 
31*11 ±0*39 
74*76 ±0*47 
75*75 + 0*45 

71 *87 ±0*51 
104*43 + 0*82 
72*46 ±1*07 
51*85+0*88 
48*32 ±0*74 
48*39 ±0*75 
48*28 ±0*54 
78*11 ±1-03 
78*34 ±0*71 
83*84 ±1*29 
82*95 ±1*71 
87“*07±0*66 
63°*30±0*61 
76°*80±0*60 
40'**91 +0*49 
29"*88±0*56 
ir*09±0*64 
58*94 ±0*44 

5*18 + 0*65 
5*60±0*59 
6*50 + 0*58 
3*40 ±0*30 
5*76 ±0*64 
4*95+0*58 
5*26 ±0*59 
11*32 + 1*32 
11*37±1*34 
10*25± 1*12 
0*06 ±0*66 
7*31 ±0*80 
7*44±0*81 
4*70 ±0*52 
5*74 + 0*03 
5*52 + 0*60 
13*05 + 1*42 
6*65 ±0*81 
5*04 ±0*64 
5*68 + 0*72 
6*40 ±0*92 
3*51 +0*43 
3*61 ±0*44 
3*63 ±0*44 
1*80 ±0*22 
2*10±0*43 
1*86 + 0*24 
l*7l±0*21 
2*30 + 0*30 
2*02 + 0*25 
3*49 + 0*74 
3*20 + 0*62 
2*59 ±0*34 
5*79 + 0*71 
3*63 + 0*42 
2*38 ±0*28 
3*16 ±0*33 
3*00 ±0*32 
3*22 ±0*36 
5*19 + 0*58 

5 *88 ±0*76 
4*02 ±0*62 
4*31 ±0*62 
4*39 ±0*53 
4*43 + 0*76 
5*64 ±0*73 
4*06 ±0*50 
7*86 ±0*91 
5*68 ±1*21 
3‘’*24±0*39 
3°*47±0*43 
3‘**37±0*42 
2‘’*77±0*35 
3^*09 ±0*39 
3‘’-62±0*45 
2*83 ±0*31 

2*77 

2*95 

3*88 

3*57 

4*22 

4*34 

5*14 

3*62 

3*59 

2*09 

4*72 

5*59 

0*07 

4*22 

4*89 

5*53 

2*47 

5*74 

7*30 

0*00 

4*78 

7*CX) 

7*20 

7*25 

7*46 

6*29 

4*43 

4*09 

7*02 

6*16 

7*20 

7*31 

6*41 

6*16 

9*73 

7*65 

[1543*3] (31) 
188*9 (94) 
190*6 (68) 
141*7 (103) 
97*3 (59) 
130*0 (31) 
114*9 (17) 
104*1 (31) 

314*4 (15) 
310*1 (23) 

379*8 (64) 
129*3 (83) 
129*0 (87) 
121*7 (71) 

98*5 (37) 
532*0 (73) 

71*7 (2*2) 
95*0 (19) 
133*3 (34) 
52*7 [28) 
52*2 (22) 
52*3 (22) 
24*5 (28) 
40*3 (16) 
42*9 (19) 
42*2 (16) 
33*6 (29) 
33*5 (27) 
50*1 (20) 

41*3 (27) 
96*0 (22) 
37*5 (20) 
31*1 (18) 

74*7 (52) 
74*9 (112) 
71*2 (25) 
104*9 (61) 
75*3 (43) 

48*1 (19) 

47*9 (19) 
47*6 (58) 
77*9 (19) 
79*4 (10) 
82*3 (18) 
81*5 (18) 
SS"*! (16) 

62"*1 (16) 
75‘’*6 (16) 
42“*4 (16) 
30" *6 (13) 
11"*9 (13) 
68*2 (35) 

[1595*7] (11) 
190*3 (19) 
192*8 (19) 
141*7 (19) 
96*6 (24) 
139*7 (11) 
115*6 (9) 
106*2 (12) 

307*3 (15) 
382*4 (12) 
128*1 (20) 
131*0 (18) 
121*7 (14) 
112*7 (24) 
117*1 (19) 
99*8 (11) 
537*1 (11) 
118*0 (12) 
69*5 (21) 
95*3 (17) 
130 0 (12) 
51*7 (19) 

24*1 (19) 

40*4 (14) 
40*5 (19) 
33*4 (14) 
34*1 (19) 
47*4 (18) 

38*9 (18) 
101*9 (11) 
39*0 (12) 
32*1 (12) 
73*5 (19) 
74*1 (18) 
71-7 (11) 
102*2 (11) 
72*4 (17) 

46*9 (19) 
82*8 (14) 
84*2 (19) 
82*5 (12) 
82*3 (18) 
86"*2 (10) 
66“*7 (11) 
73"*6 (11) 
.39"*7 (11) 
28“*0 (7) 
11"*1 (7) 
57*8 (14) 


All length meaenretnenta are in millimetres and capacities in cubic centimetres. 
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TABLE II. 

Comparative Table of Mea/ns for Female Crania, 


OharacterB 

Burwell 

liondon Musearas 

Bidford-on-Avon 

Means 

Means 

Means 

€ 

[1399-1] (13) 

[1370-0] (28) 

[1411-0] (10) 

F 

182*0 (21) 

180*9 (62) 

182*8 (13) 

L 

182*9 (22) 

182*0 (66) 

184*0 (14) 

B 

138-6 (20) 

136*6 (67) 

130*4 (lO) 

B 

95*9 (18) 

94*3 (68) 

95*5 (15) 

fF 

129*3 (13) 

129*6 (28) 

131*9 (7) 

OH 

111*6 (14) 

110*5 (19) 

111*2 (5) 

LB 

95-0 (13) 

97*4 (26) 

99-7 (7) 

Q 

305*1 (14) 

— 

— 


309-7 (14) 

301*7 (18) 

:«)3-3 (6) 

S 

366*5 (19) 

366*5 (38) 

367-1 (5) 


124*9 (20) 

124*4 (63) 

123*0 (12) 

S.2 

]26*() (19) 

123-2 (!i7) 

127*3 (10) 

^3 

llf>*fi (18) 

115-2 (46) 

119*6 (5) 


109*3 (20) 

— 

108*3 (17) 


114*1 (20) 

— 

114*9 (16) 


96*1 (18) 

97*1 (33) 

100*0 (6) 

u 

511*8 (14) 

510*9 (64) 

518*0 (6) 

OH 

109-8 (7) 

— 

110*3 (9) 

(F/l 

65*3 (12) 

05-9 (30) 

65*4 (9) 

GB 

87*2 (13) 

90*2 (2f>) 

91*0 (8) 

,7 

120*4 (5) 

125*6 (26) 

126*2 (3) 

NH' 

47*3 (12) 

47-1 (32) 

49*0 (9) 

Nil, H 

47*3 (12) 

48*4 (29) 

— 

NH, L 

47*3 (12) 

48*0 (27) 


Nil 

22*8 (11) 

24*2 (24) 

23*8 (9) 

Oi, li 

41*0 (8) 

41*6 (23) 

40*0 (8) 

6»,, L 

39*8 (8) 


39*8 (8) 

O 2 , It 

33*3 (8) 

32*8 (27) 

33*5 (8) 

O 2 , L 

33*1 (8) 

32*9 (26) 

33*9 (8) 


46*2 (3) 

51*2 (13) 

45*1 (10) 


43*3 (3) 

47*0 (13) 

— 

G 2 

.37*6(10) 

39*9 (23) 

38*2 (12) 

Gf. 

89*8 (8) 

94 0 (19) 

96-2 (6) 

fml 

36*2 (13) 

35*6 (19) 

36*2 (4) 

fmb 

29*2 (11) 

28*9 (19) 

29-6 (4) 

100 BfL 

75*8 (20) 

74*4 (61) 

73*8 (10) 

lOoi?//^ 

76*3 (19) 

75*2 (77) 

74*2 (10) 

100 H IL 

70*6 (12) 

71*5 (27) 

72*4 (7) 

1(X) BjH^ 

108*4 (12) 

105*7 (42) 

103-6 (6) 

\K^)G*HIGB 

74*8 (12) 

72*1 (41) 

71*5 (8) 

100 G'HIJ 

51*4 (5) 

— 

— 

XmNBjNU, It 

48*3 (11) 

51*0 (20) 

— 

100 NBjNII, L 

48*4 (11) 

51*1 (20) 

— 

100 NB/NH' 

48*6 (11) 

50*2 038) 

48*6 (9) 

100 O 2 IO 1 , H 

81*3 (8) 

79*6 (23) 

83*1 (8) 

100 ojo^, L 

82*2 (7) 

79*4 (24) 

85*2 (8) 

100 finhjfml 

82*5 (11) 

81*7 (18) 

81-8 (4) 

100 G^fG^ 

83*4 (3) 

76*1 (12) 

83*6 (10) 

L. B 

86®*9 (13) 

83" *9 (20) 

83"*1 (5) 

L, N 

65" *9 (8) 

67"*2 (16) 

66" *9 (6) 

L, A 

73"*1 (8) 

72"*6 (15) 

74"*6 (6) 

1, B 

41"*1 (8) 

40"*1 (16) 

38" -6 (6) 


28“*8 (8) 

29"*0 (11) 

31" *2 (3) 

L. $2 

12"*3 (8) 

10"*2 (11) 

6"*3 (3) 

Oc. /. 

59*6 (18) 

59*7 (32) 

61*0 (5) 
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All length measurements are in millimetres and capacities in cubic centimetres. 
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Anglo-Saxon Skulls from Burwell 


those shown by the corresponding characters in the Anglo-Saxon male skulls from 
the London Museums or those of the seventeenth -century Londoners which were 
excavated from a single pit in Whitechapel. A comparison of the values of the 
standard deviations in the three series for the characters selected is shown below : 


Characters 

Anglo-Saxons 

from 

Burwell 

Anglo-Saxons 

from 

London Museums 

17th-century* 

Londoners 

(Whitechapel) 

L 

5-60 


6*27 

F 

5*18 

5*49 

— 

B 

5*50 

6*79 

5*28 

jr 

5*75 

5-81 

5-56 

U 

13*05 

11-79 

15*02 

100 BjL 

3*16 

— 

3*26 

\OOBIF 

3*00 

3*12 

— 


The Burwell female series is not only smaller but rather less homogeneous than 
the male. A casual inspection of the group showed that three of the skulls differed 
from the majority in the respect that they were definitely brachycephalic in type. 

3. Mean Values and Variabilities of the principal Cranial Charactey^s and a Com- 
parison with the corresponding Characters in the Anglo-Saxon Crania, from Bidford- 
on- Avon and from the London Museums, The mean values of the principal calvarial 
and facial characters in the Burwell collection of male and female skulls may be 
compared in detail in Tables I and II with the corresponding values in the two 
other groups of Anglo-Saxon skulls, but the characters in which the three cranial 
scries diverge most notably from one another can be more readily seen from a 
study of Table III, in which the values of 

ng 4 - \ a-g ) 

for the selected series of 31 characters, 19 absolute measurements and 12 indices 
and angles, which are subsequently used for the calculation of the coefficients of 
racial likeness, are given for each pair of male and female groups of skulls. 

The general form of the skulls excavated at Burwell is illustrated by photo- 
graphs in four normae of a typical male specimen (Skull No. 56). Photographs 
of its mandible from two aspects are also given. 

4. Values of a for a selected Series of Characters showing the Features in which 
Differences in the three Cranial Series are most pronounced^. Comparing in the 
first place the Burwell male series with that from the London Museums, it will 
be seen that in only a few of the 31 characters does the value of a exceed 2*7. 
These divergent characters are O'H, B\ NH,R, Oi,22, LB and 100 O^HjOB, In the 

* Jiiomctfiha, Vol. iii. (1904), pp. 191 — 244. 

i‘ The standard deviations used for the calculation of the a’s were those of the long Egyptian series 
of the XXVI — XXX dynasties. Biometrika, Vol. xvi. (1923), p. 828. 
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Plate I 



Typical Anglo-Saxon Male Skull from Burwell (No. 56). 
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Plate II 






Typical Anglo-Saxon Male Skull from Burwell (No. 56). 
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TABLE in. 


Valves of a 


(Ml — between the Several Cramial Series, 

n» -f- \ (Tt / 


OharaoterB 

Males 

Females 

Burwell 

and 

London 

MuBeuma 

Bidford-on- 
Avon and 
London 
Museums 

Burwell 

and 

Bidford'On- 

Avon 

Burwell 

and 

London 

Museums 

Bidford-on- 
Avon and 
London 
Museums 

Burwell 

and 

Bidford-oU’ 

Avon 

100 BIL 

0*03 

2*79 

3*14 

4-30 

0*46 

6-86 

100 H’lL 

100 BjH' 

0*87 

0*22 

0-04 

0*83 

0*65 

1*76 

0*33 

3*67 

2*26 

4*11 

1*53 

5*80 

100 G HIGB 

5*63 

4*16 

0*00 

3*.37 

0*12 

2*60 

\OONBINH, It 

003 

— 


3*64 

— 

— 

100 NBjNH' 

— 

0*35 

1*64 

— 

1*31 

0*00 

100 Og / a , H 
UX ) fmbljf^ml 

0*02 

7*69 

8*27 

0*81 

3*45 

0*62 

0*8i 

0*01 

0*46 

0*14 

0*00 

0*06 

100 G^jG^ 

0-.33 

0*12 

0*07 

2*96 

6*90 

0*00 

Oc. 1, 

0-8H 

016 

1*17 

0*05 

0*71 

0*86 

i^P 

r *04 

4"*93 

2"*66 

8“*16 

0“*29 

5“*99 

L.N 

r *4 o 

12°-59 

8'*64 

O^-BS 

O ’* *04 

0‘’*.32 

L. A 

O '* *08 

r -97 

.3°*31 

0°*11 

l ‘**47 

0‘’*66 

C 

0*02 

1-86 

2*15 

0*89 

1*11 

0*08 

L 

0*77 

2-12 

4*18 

0*57 

2*00 

0*46 

B 

0*00 

000 

0*00 

6*79 

0*27 

1*58 

B 

OH 

W 

6*78 

0*51 

1*86 

2*45 

1*19 

0*09 

0*06 

4*39 

3*94 

0*04 

1*65 

1*61 

LB 

3*20 

2*42 

8*46 

3*31 

2*35 

8*11 

O' 

0*14 

7*19 

10*40 

6*43 

0*15 

2*20 

S 

0*53 

0*44 

0*04 

0*00 

0*01 

0*01 

u 

2*48 

1*31 

3*98 

0*07 

1*97 

1*17 

GH 

5*44 

3*02 

0*18 

0*22 

0*12 

0*00 

J 

0-33 

3*10 

1*53 

0*14 

0*02 

0*14 

NH, R 

0*91 



— 

1*62 

— 

— 

NH' 

— 

1*33 

3*22 

— 

3*75 

2*20 

NB 

0*78 

0 58 

0*00 

5*50 

0*39 

1*84 

Ou a 

3*38 

18*06 

8*76 

0*88 

6-24 

1*64 

O2. R 

2*69 

0*10 

0*94 

0*46 

0*89 

0*06 

Ox 

1*64 

6*23 

0*75 

6*96 

24*00 

0*32 

0^ 

1*64 

8*99 

3*61 

5*76 

3*66 

0*31 

fmh 

0*00 

1*66 

1*96 

0*16 

0-.30 

0*07 

fml 

0*09 

2*77 

4*29 

0*79 

0*34 

0*00 


Burwell series the mean measurements of all these characters are less than in 
the series from the London Museums. On comparing the Bidford-on-Avon males 
with the London Museums males the number of cranial characters in which striking 
divergence is shown is distinctly greater than in the previous case. The characters 
in which the differences in these groups are most emphasised are Oi.i2, 0x^0%, Q\ 
H\ 100 OJOx.R, 100 O'HIGB, 100 BIW, and the angles N and P. Oi,P, G,, G,. Q' 
and 100 O' HjOB, 100 P/if' and the profile angle (ZP) are all smaller in the 
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Bidford-on-Avon than in Morant’s series. The characters H\ 100 and ZiV, 

on the other hand, are greater in the Bidford series. Although the length and 
width of the palate Ot and G% are definitely less in the Bidford than in Morant's 
series, the relative proportion of breadth to length. 100^2/^1, <ioes not differ so 
sensibly in the two groups. While the mean heights of the right and left orbits, 
O9, R and O2, L, show closely corresponding values in the two series, the mean 
measurements of orbital width, Oi, R and 0 %, L, are definitely smaller in the Bidford. 
The mean value for Oi, in the Bidford skull, indeed, almost coincides with that 
tabulated for 0 /, 22 , the breadth measured from the dacryon, in Morant's series. 
As a consequence, the orbital index O2/O1, R in the Bidford group is on the average 
of much the same order as that tabulated for IOOO2/O1, R in Moran t’s series. In 
meiisuring orbital width, J. C. Brash used the curvature method to determine the 
inner terminal of the diameter and not the dacryon, and the accuracy of the 
measurements originally taken have been verified in certain cases from the diopto- 
graphic tracings of the facial normae that were prepared. The notable difference 
in orbital width in the two groups is thus, so far as we are aware, not due to any 
difference in technique but would seem to indicate a real divergence in orbital 
shape. The nasal angle N is on the average nearly . 5 ° larger in the Bidford than in 
the Museums series. The depth of the fi-tce, GL, is in the Bidford male almost 6 mm. 
greater on the average than in the Museums group, and the divergence in this 
measurement appears to be largely responsible for the angular difference. The 
mean breadth of the face, GB, is much the same in the Bidford and Morant s series, 
but the upper face height, G'H, is on the average 2 mm. less in the Bidford. As a 
consecjuence, the upper facial index, 100 G'HJGB, is definitely less in the Bidford 
than in the Museums series. 

The values of a between the selected scries of characters in the Burwell and 
Bidford-on-Avon male groups show clearly that the characters in which the 
Burwell crania differ from the Bidford are largely the same as those in which 
the crania from the London Museums differ from the latter group. The principal 
divergences occur in the characters Oi,i 2 , G2, Q\ LB, H\ L, 100 OzjOxyR and Z N. 
The mean orbital width, Oi, 22 , and orbital index, 100 Oj/Oj, 22 , in the Burwell group 
are in fairly close agreement with the corresponding values in Morant's series, and 
show the same characteristic difference from the Bidford group. Q* is on the 
average about 10 mm, greater in the Burwell than in the Bidford series and 
the basi-nasal length, LB, is nearly 4 mm. less. The depth of the face, GL, in the 
Bidford series is on the average almost 8 mm. greater than in the Burwell series, 
and this divergence, similar to that shown from Morant’s mean value, is chiefly 
responsible for the large size of the Z A" in the Bidford as compared with the 
Burwell cranium. Both the glabella-occipital length ( 2 /) and basi-bregmatic height 
(IT') arc on the average rather greater in the Bidford than in the Burwell group. 

As the mean values of the characters which are tabulated for the female crania 
in the Burwell and Bidford-on-Avon series are usually based on relatively small 
numbers of observations, the values of a between the selected characters in the 
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female groups may be considered in less detail. Comparing the Burwell series 
with Morant's series, the characters in which the values of a seem to indicate the 
most striking divergences are B, O', NB, Ox, 0%, lOOG^/Gi, 100 BIL, 100 BIB7 
and the zP, though in a few other characters a slightly exceeds 2*7. In the 
Burwell group the maximum parietal breadth, B, is on the average 3 mm. greater 
than in the Museums group. This difference in breadth is largely responsible for 
the divergence shown in the mean indices, 100 BjL and 100 BjH*, in both of 
which a exceeds the value 4. On excluding the three definitely brachycepbalic 
skulls, to which reference has been made, the mean cephalic index in the Burwell 
series is identical with that in Morant*s series. Q' in the Burwell group is greater 
than in the Museums group, but Gx, G^ and NB are smaller. The profile angle 
( Z P) is on the average 3° greater in the Burwell female, though in the two male 
groups the mean values agree fairly closely. In the Museums series of Anglo-Saxon 
crania the mean value of this angle in the male is about 4° in excess of that for 
the female. In the Burwell series the mean value of Z P in the male just exceeds 
that in the female. In the long Egyptian series E*, the mean profile angle ( ZP) 
in the male is 85*81 ± *08 and in the female 84*93 ± *09, which would suggest that 
when the number of observations is large the profile angle in the male does not 
differ greatly from that in the female, probably not more than 1®. The large 
sexual difference in the Anglo-Saxons from the London Museums may probably 
be due to the small numbers on which the averages are based and yet, as will be 
seen later, a similar sexual difference, though of lesser degree, is found in the 
Bidford-on-Avon group. The difference may really be indicative of relative 
prognathism in the female Anglo-Saxon, though it is not evident in the Burwell 
type. 

A scrutiny of the values of a for the selected characters in the Museums and 
Bidford-on-Avon females shows that the only features in which differences that 
may be considered significant occur are Gx, G^, Oi,P, NN', 100 0s/Oi,P and 
IOOG 2 /GX. The mean palatal length and breadth, Gx and (? 2 , in the Bidford 
group are apparently smaller than the corresponding values in Morant*s scries, 
and as the length is relatively more in defect than the breadth, the palatal index, 
100 GtjGi, is definitely greater in the Bidford series. The orbital width, Oi, It, is 
less in the Bidford than in Morant’s group and the orbital index apparently greater. 
The mean value of Nir in the Bidford group, on the other hand, is greater than 
in the Anglo-Saxons from the London Museums. 

Comparing the Burwell and Bidford-on-Avon females, the only values of a 
which seem to indicate significant differences in the corresponding characters are 
those for LB, 100 BjL, 100 BjH^ and .d P. The basi-nasal length, LB, in the 
Bidford female is on the average very large and exceeds the mean value for 
the corresponding character in the Burwell group by nearly 5 mm. A like 
difference was found in the male crania. The mean profile angle ( z P) in the 
Bidford female is based on only five observations. The individual angles in this 
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group were measured on the dioptographic tracings of the normae laterales and 
not directly by the stationary goniometer, but this is not likely to have influenced 
very materially the estimation of the size of the angle, as both methods when 
tested on two or three male skulls gave almost identical results. The mean value 
of the Z P, as has been mentioned, is in close agreement with that for the Anglo- 
Saxons from the London Museums which, though probably a smaller value than 
would be found if it were based on a large number of observations, may indicate 
a relative tendency to prognathism in the Anglo-Saxon female. In the Burwell 
group the mean maximum breadth is greater, and the mean maximum length and 
mean basi-bregmatic height less than in the Bidford. As a consequence the in- 
dices 100 BJL and 100 BjH' in the Burwell group both exceed the corresponding 
indices in the Bidford group to a greater degree than might be expected to occur 
fortuitously. 

5. Coefficients of Racial Likeness for the Cranial Series, Having enumerated and 
discussed the characters for which the mean measurements in the groups of skulls 
seem to show differences of such a degree that they may probably be considered 
significant, we now proceed to consider the coefficients of racial likeness computed 
between each pair of groups of skulls for the 31 selected characters. The coeffi- 
cients, crude and reduced, are shown for males in Table IV and for females in 
Table V. I'he reduced coefficients are relatively comparable as they have been 
brought to a common standard by making allowance for the varying number of 
skulls on which the crude coefficients are based. 

The crude coefficients for the male groups seem to indicate quite clearly that 
in shape and in general characters the Burwell crania are almost identical with 
the Anglo-Saxon crania from the London Museums. Both these groups show a 
slight divergence, almost equal in degree, from the Bidford-on-Avon crania. As 
the mean number of skulls on which the Burwell and Museums coefficients are 
based is much the same, the adjustment for varying numbers of skulls does not 
change the relationship of the Bidford-on-Avon group to these two groups. The 
reduced coefficients of racial likeness between the Bidford-on-Avon male group 
and the Burwell and Morant’s series respectively are, however, about six times the 
size of the coefficient between these two series. 

The crude coefficients of racial likeness for the female groups, Table V, suggest 
that in regard to shape-characters (indices and angles) the Burwell series diverges 
only slightly and to an equivalent degree from the two series from Bidford-on-Avon 
and the London Museums. In its characters, generally, the Burwell series seems 
to resemble the Bidford-on-Avon series rather more closely than that from the 
London Museums. The Loudon Museums cranium is almost identical in shape 
with the Bidford-on-Avon cranium, and diverges only slightly from it when com- 
parison is made between all the 31 characters under review. 

Though a comparison of the reduced and relatively comparable coefficients 
which are given in Table V suggests that in their general form the female Anglo- 
Saxons from the London Museums may resemble the Bidford-on-Avon crania 
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TABLE IV. 

Coefficients of Racial Likeness between the Male Cranial Series, 


CribdU Coefficients 


Burwell (84-8) 

London Masenms (85*8) 

Bidford-on-Avon (14*8) 

All 

Oharaoters 

Indices and 
Angles 

All 

Characters 

Indices and 
Angles 

All 

Characters 

Indices and 
Angles 

Burwell (34*8) 



0*66 ±*17 

-0*05 ±*28 

1*96±*17 

1*64 ±*28 

London MuseutUH (35*3) 

0*66±*17 

-0*05+ *28 

• — 

— 

2*27 ±*17 

2*21 ±*28 

Bidford-on-Avon (14*8) 

1*96±*17 

1*64 ±*28 

2*27 ±*17 

2*21 ± *28 

— 

— 



Reduced 

Coefficients 




Burwell (34*8) 





-0*16± *91 

9*49 ±*82 

8*32±1*42 

London Museums (35*3) 

1*63 ±*49 


— 

— 

1 0*88 ±*82 

11*74±1*45 


9*49 ±*82 

8*32 ±1*42 

10*88 ±*82 

11-74±1*45 

— 

— 


TABLE V. 

Coefficients of Racial Likeness between the Female Cranial Series, 


Crude Coefficients 


Burwell (12-1) 

London Museums (80*6) 

Bidford-ou-Avon (7*6) 


All 

Indices and 

All 

Indices and 

All 

Indices and 


Characters 

Angles 

Characters 

Angles 

Characters 

Angles 

Burwell (12*1) 



1*33 ±*17 

1*44 ±*28 

0*51 ±*17 

1*08 ±*28 

London Museums (30*5) 

1*33±17 

1*44 ±*28 

— 

— 

1*16±*17 

0*40 ±*28 

Bidford-on-Avon (7*6) 

0*51 ±*17 

1*08 ±*28 

1*16±*17 

0*40 ±*28 

— 

— 



Reduced 

^oejfficiente 




Burwell (12*1) 





7*73 ± *99 

911±1*77 

5*46 ±1*82 

12*49±3*24 

London Museums (30*5) 

7*73 ± *99 

9*11±1*77 

— 

— 

9*54±1*40 

3*68 ±2*50 

Bidford-on-Avon (7*6) 

5*46 ±1*82 

12*49 ±3*24 

9*54 ±1*40 

3*58 ±2*50 

““ 

— 


more closely than they do the Burwell or than the Burwell resemble the Bidford- 
on-Avon group, it is obvious fix)m the size of the probable errors of the coefficients 
that little emphasis can be laid on such differences as are observed. The coefficient 
for all characters between the Burwell and Bidford-on-Avon groups is rather less 
than that between the Burwell and the London Museums groups, and the latter is 
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less than the coefficient between the Bidford-on-Avon and the Museums groups, 
but it is again apparent from the size of the probable errors of the coefficients 
that no real differences in degree of resemblance can be postulated on such data as 
are available. 

6. Mandibles : Material available for Measurement The mandibles belonging 
to the Burwell skulls were also measured in accordance with the scheme of 
measurement now followed at the Biometric Laboratory, University College, 
London. 02 specimens were found to be sufficiently complete, though reconstruc- 
tion was necessary in many cases, to provide measurements of the majority of the 
characters that are usually brought under review. Of these 42 were of the male 
and 20 of the female sex. The measurements of the individual mandibles are 
tabulated in Appendices VII and VIII. 

7. Mean Values and Variabilities of the principal Mandibular Characters and 
a Comjiarison with the corresponding Characters in the Anglo-Saxon Mandibles from 
Bidford-on-Avon and the London Museums, as well as the Mandibles from Dunstable, 
The mean values of the characters measured and the indices computed from these 
for the Burwell and Bidford-on-Avon series are shown for males in Table VI and 
for females in Table VII. For comparison with the male series of mean values, the 

TABLE VI. 

Comparative Table of Means for Mandibles of Male Skulls, 


Character 

No. 

Burwell 

London 

MuBenms 

Bidford- 

on-Avon 

Dunstable 



Mean 

a 

tf 

Mean 

Mean 

Mean 

?ri 

35 

122*44 ±1-07 

0*32 ±0*76 

5-16 

123-7 (25) 

117*7 (20) 

121-0 (29) 


41 

98-82+ 1*02 

0-56 ±0*72 

0*04 

100-4 (33) 

95*1 (27) 

98-4 (40) 


37 

98*81 +0-79 

4*83 ±0*50 

4*89 

100*3 ^27) 

97-4 (24) 

99*3 (27) 

z::; 

42 

45*75 + 0-33 

2*17±0*24 

4*74 

45*3 (57) 

46*1 (30) 

45*4 (42) 


39 

21 19 +0*34 

2* 13 ±0*24 

10-05 

21*7 (38) 

20-5 (21) 

21*3 (35) 

ml 

42 

105*55 + 0-93 

6*01 ±0-66 

5*69 

107*2 (31) 

106-1 (28) 

106*8 (34) 


42 

78-12±O-80 

5*21 ±0*57 

6-07 

77-7 (42) 

78-1 (30) 

76-2 (38) 

rh' 

42 

32-07 ±0-49 

318±0*35 

9-92 

33*2 (61) 

32-6 (30) 

32-8 (42) 


39 

28-49 ±(>*20 

1*24±0*14 

4-35 

28*1 (59) 

28*3 (29) 

28*0 (39) 

K 

40 

31 *05 ± 0*38 

2*41 ±0*27 

7*61 

33*1 (40) 

32-2 (31) 

32*5 (39) 

m2/i 

39 

27*42 ±0*38 

2*39 ±0*27 

8-72 

27*2 (51) 

28-9 (29) 1 

26*6 (41) 

c^h 

40 

64-80+ 1-03 

6*51 + 0*73 

10*05 

65*7 (48) 

66*6 (28) 

66-0 (38) 

rl 

42 

63*04 ±0*99 

0-39 ±0-70 

10*04 

64*0 (45) 

65-5 (26) 

64*7 (38) 

L M 

42 

120"11±1-15 

7"*43±0-81 

— 

120" -3 (47) 

119"-7 (30) 

120" -9 (39) 

L. It 

39 

7r'*01 +1*21 

7"*57±0*86 

— 

72"-0 (36) 

72"*1 (22) 1 

68"-7 (37) 

JL C 

40 1 

60‘'-74±102 

6“-44±0*72 

— 

68" *2 (32) 

68"*4 (28) i 

69"*0 (36) 


40 

61 *66 ±1*09 

0*88 ±0*77 

— 

60*9 (27) 

62*9 (26) 

59-3 (31) 

1(K) 

37 

94*04±1*15 

7*02 ±0*82 

— 

94*4 (15) 

92*6 (24) 

93*6 (26) 


41 

127*19±l-76 

11*25±1*24 

— 

129-0 (32) 

122*6 (27) 

129*7 (38) 

g off 

30 

100*11 ±1*21 

7*29 ±0*86 

— 

99*3 (19) 

97*6 (23) 

100*0 (27) 

100 rVjrl 

42 

50*72 ±0*88 

5*73 ±0*63 


51*5 (45) 

49*4 (26) 

61*0 (38) 
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means of the corresponding characters in the mandibles of the Anglo-Saxon skulls 
in the London Museums which were measured by G. M. Morant* and of the 
Dunstable mandibles f which were known to be very similar to this series of 
Anglo-Saxon mandibles are also tabulated. For the mandibular characters of the 
Burwell group of males, the standard deviations and coefficients of variation are 
also given, as the numbers of observations on which these are based are usually 
about 40. The standard errors of the means and standard deviations are also 
included in the table. For female mandibles, the means of the characters and the 
numbers of observations on which these are based are only given for the three 
groups from Burwell, Bidford-on«Avon and the London Museums. The small 
number of female mandibles from Dunstable did not seem to warrant even the 
calculation of mean values. 

The means of the corresponding characters for males in the several groups can 
be compared in detail in Table VI, but the characters amongst a selected series in 
which the male mandibles of one group differ most strikingly from those of the 
other groups can be seen more readily by reference to Table VIII in which the 
values of a between the characters for each pair of groups are given. The selected 


TABLE VII. 

Comparative Table of Means for Mandibles of Female Skulls, 


Cbaraoter 

Burwell 

London Mueeums 

Bid ford- on- Avon 

Mean 

Mean 

Mean 

W’l 

115-9 (16) 

116*6 (22) 

114*1 (H) 

</o(h 

91*7 (19) 

92-9 (36) 

90*8 (16) 

CrC, 

92-9 (16) 

93-2 (28) 

94*0 (16) 

zz 

44-1 (20) 

44*1 (50) 

46-1 (19) 

Cyl 

19-2 (19) 

19*1 (35) 

20*4 (13) 

ml 

99-9 (20) 

104*2 (46) 

102*8 (17) 

Cpl 

73*4 (20) 

.30*0 (20) 

74-6 (49) 

76-8 (18) 

rb' 

31-0 (66) 

31*7 (19) 

m2 Pi 

27*9 (18) 

27-6 (57) 

27*8 (18) 

hi 

29*2 (18) 

30-6 (31) 

29*4 (19) 

m2h 

26-3 (18) 

24-4 (62) 

26-8 (18) 

Cyh 

58*4 (18) 

69‘2 (47) 

62*0 (18) 

rl 

57-3 (19) 

69-1 (46) 

61*5 (1.5) 

lM 

122'’*6 (20) 

122°'6 (49) 

119'’*9 (18) 

Lit 

69"*5 (16) 

68”'2 (36) 

69"*5 (14) 

lC* 

67”’8 (19) 

70°-6 (32) 

69“-8 (17) 

XODc^hjml 

58-5 (18) 

68*3 (38) 

60*3 (17) 


93*4 (16) 

91 *7 (26) 

90*2 (14) 

^00 ffof^olOpl 

125-7 (19 

126*2 (36) 

118*6 (16) 

100 fToffol^r^r 
100 rb'/rl 

99*2 (15) 

99*3 (23) 

97-1 (14) 

53-8 (19) 

63-0 (4:1) 

52-1 (16) 


* Biometrikuy Vol. xvin. (1926), p. 96. 
t Tbid.y Vol. XXV. (1988), pp. 147—157. 
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TABLE VIII. 


Values of a 


n, rv /Ma - 

rig -f ng \ cTg ) 


between the Several Male Mandibular Series. 


Character 

Burwell 

and 

London 

Museums 

Burwell 

and 

Bidford- 

on-Avon 

Loudon 
Museums 
and Bidford- 
on-Avon 

Burwell 

and 

Dunstable 

London 

Museums 

and 

Dunstable 

Bidford- 

on-Avon 

and 

Dunstable 

Wy 

0-75 

8-.57 

12-20 

0-95 

2-98 

3-93 

zz 

0*85 

()*22 

1-76 

0-47 

0-03 

1*20 

rl 

0-14 

2-42 

1*54 

1*01 

0-42 

0*41 

rV 

5-40 

0-79 

1-30 

1-85 

0*71 

0*13 

Cyl 

1 -.-is 

1-85 

5-40 

0*05 

0*81 

2-33 

ml 

1*89 

0*17 

0-74 

1*12 

0-11 

0-31 

K 

4*30 

0*48 

1-55 

1-39 

0-78 

0-17 


1-26 

0*22 

0*26 

1-63 

0 08 

0-50 

Lit 

0 "* 2 « 

0"*26 

o“-oo 

r -51 

2"*99 

2"*40 


0*56 

3*44 

5-91 

1-18 

0*08 

7-82 


list of mandibular characters in Table VIII is that used by Q. M. Morant for the 
calculation of the coefficients of racial likeness between mandibular groups and 
we are indebted to him for giving it to us. These characters have been selected 
by this author because of their relatively slight correlation with one another and 
because of the relative frequency with which they differ significantly in a series 
of racial types that were studied by him. 

8. Values of a for a. selected Series of Mandibular Character's showing the 
Features in tohich Differences in the Mandibular Groups are most emphasised'*^. For 
the Burwell and London Museums series of male mandibles the only two characters 
of the ten tabulated in which a exceeds 2*7 are r6' and hx- Both these characters 
are smaller on the average in the Burwell series than in Morant s series. 

Comparing the Burwell and Bidford-on-Avon gioups, two characters, Wx and 
100 gogojcply alone show values of a exceeding 2*7. For the bicondylar width, Wxt 
the value of a exceeds 8. In the Bidford series this character is almost 6 mm. less 
on the average than in the Burwell series; the mean value of \00 gogolcpl in the 
Burwell group is rather less than in the mandibles from Bidford-on-Avon. 

In the London Museums and Bidford groups the values of a for three characters 
exceed 2*7. These arc Wx, Cyl and lOOgogolCpl. The Bidford-on-Avon mean value 
of Wi is about 6 mm. less and 100 go9olCpl almost 5 units less than the corresponding 
values for the mandibles from the London Museums ; the Bidford series thus holds 
in respect of these two characters a similar relationship to the Burwell and 
Museums groups. Reference to Table VI shows that the mean value of Cpl is of 
much the same order in the three groups, but that the mean value of g^^go is 4 to 

* The standard deviations used in the calculation of the a’s for the mandibular characters are those 
of the Qau Egyptian series of the IVth dynasty which Dr G. M. Morant has kindly given to us. 
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6 mm. less in the Bidford group than in the other two ; the difference in bigonial 
width is thus almost wholly responsible for the difference in the index. The 
average condylar length, Cyl, is rather less in the Bidford-on-Avon mandible than 
in that from the London Museums. 

For the Burwell and Dunstable groups, all the values of a are under 2’7. 

For the London Museums and Dunstable series only two characters, Wx and Z. iJ, 
have values of a exceeding 2*7, but in neither case does the value exceed 3. The 
mean values of these two characters in the Dunstable mandible are rather less 
than the corresponding values in that from the London Museums. 

In the Dunstable and Bidford-on-Avon series the values of a for two of the 
ten characters, namely Wx and lOOg^ggjCpl, are in excess of 2*7. The value for the 
index is almost equal to 8. The mean values of these two characters are both less 
in the Bidford group. The relatively narrow bicondylar width, Wx^ in the Bidford 
group has already been commented upon. It is about 3*5 mm. less on the average 
than in the Dunstable group ; the bigonial width, gogoi shows a deficiency of about 
the same order. The mean body length, Cpl, on the other hand, is rather greater 
than in the Dunstable mandible. The relatively small bigonial width is thus again 
mainly responsible for the divergence in the index. 

The female mandibles may be dealt with more briefly. The values of a for the 
selected characters in the three Anglo-Saxon series are given in Table IX. 

Comparing the characters in the mandibles from Burwell with those from 
the London Museums, only one value of a, that for the mandibular length, ml, 
appreciably exceeds 2*7 ; the value is actually higher than 10. This character is 
definitely smaller in the Burwell group. The value of a for the rameal length, rl, 
just exceeds 2*7 ; in this case also the Burwell measurement is the smaller. 


TABLE IX. 


Values of a = - V between 

flti "f* 'fig* \ Cg / 


the Several Female Mandibular Series. 


Character 

Burwell 

and 

London MuBeums 

Burwell 

and 

Bidford-on-Avon 

London MuseumH 
and 

Bidford-on-Avon 

Wi 

0-16 

0-73 

1*58 

zz 

0*00 

5*72 

8*09 

rl 

3*01 

10*27 

4-50 

rh' 

0-46 

2*31 

1*36 

Cyl 

007 

6*18 

8-90 

ml 

10*62 

3*21 

1*00 

h\ 

2*60 

005 

1*93 


0*87 1 

0*06 

0*39 

LR 

0"*52 

0“*00 

0°*47 


0*03 

1 

3-81 

1 

6*51 


Biometrika xxvn 


26 




402 Anglo-Saxon Sktdh from Burwell 

Comparing the Burwell and Bidford-on-Avon groups, five of the ten characters 
have values of a exceeding 2*7. These are zz, rl, Cyl^ ml, and 100^o^„/Cpi. All these 
dimensions except the last are smaller in the Burwell mandible. The index, 
100 gogojopl is definitely greater in the Burwell group and the divergence seems 
to depend on a relatively smaller body length, Cpl. 

Comparing the mandibles from the London Museums and Bidford-on-Avon, 
four of the ten characters under review have values of a in excess of 2*7. These 
are zz, rl, Cyl, and lOOgogojcpL All of these characters except the last are smaller 
in the mandible from the Museums; 100 gogol(^pl is greater. The relationship of 
the mandible from the Museums to that from Bidford in respect of these four 
characters is thus the same as that of the Burwell mandible. The relatively 
greater index, 100 gogolcph in the Museums mandibles than the Bidford appears 
to depend about equally on its two components as the mandibular body length, 
Cpl, in the Museums series is as much in defect of the Bidford-on-Avon value as 
the bigonial width, gogo^ is in excess of it. 

9. CoefficientH of Racial Likeness for the Mandibular Series, Having described 
the characters amongst those selected for computation of the coefficient of racial 
likeness in the mandibles in which differences between the groups were most 
apparent, attention may now be directed to the coefficients of racial likeness, crude 
and reduced, for the groups of male mandibles in Table X. 

TABLE X. 


Coefficients of Racial Likeness between the Several Male Mandibular Groups, 


Crude Coefftcientu 


Burwell 

iiondon Museums 

Bidford-on-Avon * 

Dunstable 


(40-1) 

(42-4) 

(26-4) 

(37*3) 

Burwell (40‘1) 


0-68+ -30 

0*84+ -30 

0-12+ *30 

London Museums (42*4) 

0*08 ± -30 

— 

2*07 + -30 

-0*10+ *30 

Bidford-on-Avon (20*4) 

0*84 + *30 

2-07 + -.30 


0*92+ -30 

Dunstable (37*3) 

0-1 2 ±-30 

-0*10 ±-30 

0-92 ±*30 

— 


Red 

\uced CoeffioientR 



Burwell (40*1) 

1 

1 *65 ± *73 

2*64 ±*94 

0*31 + -78 

London Museums (42*4) 

i«r)±-73 

— 

6-36 ± *92 

- 0*2.5 ± *76 

Bidford-on-Avon (20*4) 

2-64 ± *94 

e-36±*92 



2*98 ± -97 

Dunstable (37*3) 

0-31 + *78 

--0*2.'3±-76 

2*98 ±*97 

— 


The values of the crude coefficients seem to indicate that the Burwell male 
mandibles, at least so far as this group of selected characters is concerned, cannot 
be said to differ from those in the London Museums or from Dunstable, but may 
show a slight divergence from the Bidford-on-Avon mandibles. The last coefficient 
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of racial likeness almost attains the conventional level of significance. The type 
of mandible in the London Museums is identical with that from Dunstable, but 
diverges appreciably from that found at Bid ford-on- A von. The Bidford-on-Avon 
mandibles show a sensible divergence from those excavated at Dunstable. 

The reduced coefficients of racial likeness which are also given in Table X 
indicate the relative degrees of relationship between the different groups. The 
coefficient between the Burwell mandibles and those from the London Museums 
is almost five times that shown between the Burwell and Dunstable groups, but 
this difference in value cannot be said to connote any real difference on the data 
available ; both coefficients are statistically insignificant. The coefficient between 
the Burwell and Bidford-on-Avon mandibles is almost nine times that shown 
between the Burwell and Dunstable mandibles, and, as has been mentioned, may 
possibly be taken as indicative of a real divergence in the two Anglo-Saxon groups. 

The coefficient between the mandibles from the London Museums and those 
from Bidford-on-Avon is fully twice that found between the mandibles from 
Burwell and the latter place. The Burwell mandibles would appear to be definitely 
more closely related to the Bidford-on-Avon mandibles than are those in the 
London Museums. The relationship of the Burwell mandible to the Bidford-on- 
Avon mandible is of the same order as that shown between the Bidford-on-Avon 
and Dunstable mandibles. The Dunstable mandibles appear to be identical in 
type with the Anglo-Saxon mandibles from Burwell and from the London Museums. 

The coefficients of racial likeness, both crude and reduced, for the female 
groups of mandibles are given in Table XI. As the mean numbers of mandibles 
on which the coefficients are based are almost the same in the Burwell and Bidford- 
on-Avon groups, the respective crude coefficients between these groups and the 
group from the London Museums are relatively comparable without adjustment. 

TABLE XL 

Coefficients of Racial Likeness between the Several Female Mandibular Croups. 


Crude Coefficients 


Burwell 

London Museums 

Bidford-on-Avon 


(18-6) 

(41-2) 

(16-1) 

Burwell (IB’5) 


0-83+ -30 

2-23 ±-30 

Loudon MuseuiiiH (41 '2) 

0-83-1- -30 

— 

2*37 ±-30 

Bidford-on-Avon (16*1) 

2-23 ±*30 

2-37 ± -30 



Reduced Coel^ 

Heients 


Burwell (18*6) 

__ 

3-25 ±1-17 

12-95 ±1*74 

London MusoumB (41-2) 

3*26 ±1*17 

— 

10-24 ±1-30 

Bidford-on-Avon (16*1) 

12-96 ±1-74 

10-24 ±1-30 
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The coefficient between the Burwell mandibles and the mandibles from the London 
Museums is less than unity and statistically insignificant, although it almost attains 
the conventional level of significance. That between the Bidford-on-Avon and 
Museums mandibles is significant and nearly three times as large as the other. 
The crude coefficient between the Bidford-on-Avon and Burwell groups is also 
significant and of about the same size as that between the mandibles from Bidford- 
on-Avon and the London Museums. 

Adjustment of the coefficients to the values they would have, if based on 100 
mandibles in the groups compared, results in a reduced coefficient between the 
Burwell mandibles and the mandibles from the London Museums of a value 
exceeding 3. The reduced coefficients between the Bidford-on-Avon mandibles 
on the one hand and the mandibles from Burwell and the London Museums on 
the other have values of about 13 and 10. These do not differ from one another 
to a significant degree, but are of the order of magnitude of reduced coefficients of 
racial likeness which are occasionally found between groups of mandibles belonging 
to crania of distinctly divergent racial type. On the other hand, two groups of 
crania may show considerable divergence in type while the corresponding groups 
of mandibles exhibit almost identical characters. Such was indeed found to be 
the case on comparing the Dunstable material with the Anglo-Saxon material in 
the London Museums. 

10, General Discussion and Summary, The cranial and mandibular characters 
of the groups of Anglo-Saxon skulls from Burwell, Cambridgeshire, and Bidford-on- 
Avon, Warwickshire, have been compared both individually and collectively in 
considerable detail and both groups have been compared in respect of the same 
characters with the Anglo-Saxon skulls in the London Museums which were 
measured and described by Q. M. Morant*. The comparison of the male crania 
in the two first-mentioned groups is the more important as these are almost twice 
as numerous as the fem.ale specimens. The main inferences to be drawn from 
the comparison of these male groups appear to be that the Burwell skulls, both 
crania and mandibles, are almost identical in type with the skulls from the London 
Museums. If material divergence is present in any feature, it occurs in the size of 
the upper face and its proportions. Both these groups of Anglo-Saxons seem to 
show, however, some divergence, about equivalent in degree, from those excavated 
at Bidford-on-Avon. Where divergence in these types is found, however, it is seen 
to occur mainly in those characters or features, into the accurate measurement of 
which the personal element largely enters and in which slight variance of technique, 
such tis identification of terminals of diameters or arcs, on the part of different 
observers might probably produce average differences of the order that are found 
in the present case. Among such characters are the orbital width Oi, R, and, as 
a consequence, the orbital index lOOOa/Oi, R, the palatal length (?i, the palatal 
breadth 6?a, and the transverse arc Q\ The nasal height, NH, R or NH' is also 
such a character although it shows no divergence in the types under review. In 
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only one or two of its other characters, amongst those selected for comparison, and 
notably the nasal angle (N), does the Bidford cranium differ sensibly from the 
other two types. Comment has already been made on the relatively large mean 
measurement of the facial depth (OL) in the Bidford cranium as compared with 
that shown in the other types and the influence of this chord on the size of the 
angle N. The differences in the recorded mean measurements of these characters 
may be real but some caution is advisable in accepting them as of the order of 
magnitude found, when, as in the present case, the measurements in the different 
series have been taken by different observers. Fairly substantial differences in 
certain characters might probably be expected in Anglo-Saxon cranial material 
from different localities. Reference has been made to the probable existence of 
two distinct types, recognizable on general inspection, amongst the male crania 
from the Bid ford -on- Avon site. Morant* on the other hand distributed the Anglo- 
Saxon skulls in the London Museums according to their place of origin into the 
four groups : West Saxons, South Saxons, Angles and Jutes, and came to the 
conclusion from a comparison of the actual mean mefisurements of the principal 
characters in these groups and the coefficients of racial likeness between them 
“ that they represent populations which are extremely similar if not absolutely 
identical. Only one C.R.L. — that between the Angles and West Saxons — suggests 
any real difference of type.” 

Because of the relatively small numbers of female skulls in the Burwell and 
Bidford-on-Avon groups, very slight reliance can be placed on the mean values of 
the characters which are tabulated for these being truly representative of the types, 
but it is of interest to note in what respects, if any, the data available corroborate 
the conclusions that have been drawn from a comparison of the male series. The 
number of female crania from the London Museums is definitely greater and 
should provide more reliable means. The Burwell female, unlike the male, differs 
appreciably in many of its characters from that in the London Museums. We have 
referred to the presence of a few distinctly brachycephalic skulls in the Burwell 
group. These are mainly responsible for the relatively great mean maximum 
breadth (B) in the Burwell female and its divergence in this character and in the 
indices of which it is a component, 100 B/i and 100 BjH\ from the female Anglo- 
Saxons in the Museums. The minimum frontal breadth (JS') is also significantly 
greater in the Burwell female skull. Significant differences are also shown in the 
palatal length and breadth, the palatal index, the nasal breadth, the nasal index, 
the transverse arc (Q') and the profile angle ( Z P). The mean differences are so 
many and so relatively large that they suggest the female crania from the two 
sources are really of different type. 

The Bidford-on-Avon female cranium differs from that of the London Museums 
in its orbital, nasal and palatal measurements, the characters in which differences 
may moat readily arise from slight differences in technique as has been mentioned 
in the comparison of the male skulls. In their general form, these two female 
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crania would appear to be almost identical, but when all (the selected) characters 
are considered, they seem to show a divergence in type of about the same degree 
as that found in the corresponding male groups. 

From the Burwell female cranium, the Bidford-on-Avon cranium differs mainly 
in having a relatively smaller maximum parietal breadth {B) with, as a con- 
sequence, smaller length-breadth (100 JB/X) and height-breadth (100 indices; 

a greater basinasal length and a smaller profile angle ( P). When all the shape 

characters (indices and angles) are considered, the two crania show a divergence 
in type, and when the comparison embraces all the selected characters some real 
divergence in type still seems to be indicated. The relatively small profile angle 
( Z P) in the Bidford-on-Avon female and also in the female Anglo-Saxon from 
the London Museums as compared with that in the Burwell female, and the 
possibility of the existence of a relative prognathism in the Anglo-Saxon female, 
although it is not revealed in the specimens from Burwell, has already been 
discussed. 

The data for the female Anglo-Saxon crania thus seem to confirm the existence 
of some divergence in type between the Bidford-on-Avon and Burwell, and Bidford- 
on-Avon and Museums groups which was suggested by a comparison of the cor- 
responding male groups. Comparison of the female crania from Burwell and the 
London Museums would also suggest some diversity in these types rather than 
the identity which seemed to be established by a study of the characters in the 
male groups, but little importance can be attached to the apparent diversity of 
type in the female groups in view of the heterogeneity known to exist in the 
female crania from Burwell and their small number. 

Turning to the mandibular relationships, there seems to be evidence that, so 
far as the selected characters are concerned, the Burwell male mandible does not 
differ significantly from the Anglo-Saxon mandible in the London Museums or 
from that of the specimens excavated at Bidford-on-Avon, although in the 
latter case the coefficient of racial likeness almost attains the conventional level 
of significance (three times p.e.). The male mandible for the London Museums, 
on the other hand, differs sensibly in the selected features from that found at 
Bidford-on-Avon. 

While the Burwell female mandible cannot be said to differ certainly from the 
female Anglo-Saxon mandible in the London Museums, although the coefficient 
of racial likeness between them is on the border line of significance, both the 
female Anglo-Saxon mandible from Burwell and that described by Morant appear 
to differ significantly from the Bidford-on-Avon type. The reduced coefficients 
of racial likeness in the last two cases are as large as are those occasionally found 
between samples of mandibles that belong to crania of quite diverse racial type. 

In a general way it may be stated that the study of the mandibles confirms 
largely the evidence obtained from the crania as to the interrelationships of the 
three groups of Anglo-Saxon skulls. 
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The apparent identity of type that exists between the Dunstable mandibles, 
on the one hand, and the Anglo-Saxon mandibles from Burwell and from the 
London Museums, on the other, is of special interest as the corresponding crania 
when examined were found to be distinctly divergent in type. 

We have provided records of a considerable amount of unpublished Anglo- 
Saxon cranial material and have ventured to draw some inferences from the data 
we have studied as to the probable relationships of the cranial samples from 
Bidford-on-Avon, Burwell and the London Museums, but we fully recognise, and 
are ready to acquiesce in the view, that a final expression of opinion on real 
diversity or identity of Anglo-Saxon groups from different localities will only be 
possible when more material is available for study and when all the evidence 
obtainable from this material, including that provided by cranial contours of the 
several normae, is utilised. 




ON THE NUMERICAL EVALUATION OF HIGH ORDER 
INCOMPLETE EULERIAN INTEGRALS. 

By KARL and MARGARET V. PEARSON. 


I. The Incomplete V-Function, 

The Incomplete F-Function is defined by 

f»0 

= (i), 

and the Incomplete F-Function Ratio by 

\ ^ e'~^ dxjV {p) (ii). 

Ju 

The Ratio is the probability integral of the frequency curve 

2/ “ (iiO- 

This curve has for its characteristics 

Mode = = p — 1, Mean — w—p, and Standard Deviation = s/p , . .(iv). 

In the Tables of the Incomplete V-Function^ the notation is somewhat different. 


Therein 


f M 

/(w, p)= e’^^ajPdxjV {p’^V), 

JO 

or I ^ ^ lo” (j>) ■= I' (a^o. P) (v). 

Let us transfer (iii) to its mode and then integrate it up to a distance x' beyond 

the mode, where actually x' may be positive or necfative. 

i'(^o,p)r(p) 

/ J>— 1 fXQ* 

e''^xP~''^dx + / e" (^ — 1 -f dx' 

- 1 - l) r (p) + (p - 1 )^^/'”' 


Now let us take 
Then 

Tf — T ( P ^ 


®7(P - 1) = < (vi). 


r(...p)=7(?'. .p-i) 


•J2tr e- 
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Evaluation of Incmnplete EvleHan Integrals 


where 


t 1 \ '»<» — ( P - 1 ) 

fe = «!o/(p-l)= 

p _ «“**’“*’ (p — l)P~i 


.(viii). 


and is, for p = 50 and upwards, only slightly less than unity ; 

Tr,= {e-*(l + 0}^-*. 

1 Mr r __ 1 \ f j . 1 /■* . _.\1 y_ 1 \ / f* 


logTT, = (p-l){-i + log(l + t)} -(p-l)(-^ + g- j + ...). 

/ HZ'*’ ^ 

«e-4(p-i)«^(l4.y+ + + + (x), 


where 




The part of Wt independent of T will give us 

PoV p^ r%-4o.-i)«., 

^/ 27r J 0 

Taking f® = (p — 1) this becomes 

ffo 1 ,4, 


where 


V p — 1 


(xiii). 


This can be found at once from the Tables of the Normal Probability Integral 

“■Poi«f„ (xiv), 

where is always less than (or equal to) 

We now turn to the remainder of Wf. This gives 

Wt = e-i «* (r + | T’* + ^ T» + T* + . . .) 

= e-i'i’-i'‘*j|(p-l)«*i(p-l)t* + i(p-l)<»-(l-±(p-l))(p-l)<« 

- (re - S tVj- - *>*) 


/ 1 3349 . , , 

Vl2 30240^^ 


17280 ' 
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and we need 


C-(P-3J (p - 


faro' 

Jo 


g— ip—i) f « _ 


(p-iF (*• 


where 


r(p) Jo Tip) 

»re <o = ®o7(/>-1) = (®b-(1>- 1))/(P- !)• 

We will now change our variable to e, where 


Wrdt ,,.(xvi), 




with a?o' representing the distance from the mode to the boundary of our original 
F-fu notion. 

Since t = > wo find \/ p ^ i 


p-i 


and thus we have 


L e-^^Hp-\)P-^ , 


Wpdz 




Substituting z for t in (xv), we find 

r (ico , />) = f ° e-® dxjV ( p) 
J 0 


where 


- l) (-7071, 0678^0, 1) 

+ 01.5/ (-6324^553^0. I'S) + a,/ (-5773,5027^0. 2) 

+ a,.*/ (-5345, 2248^0, 2-5) + a* /( 5000,0000 «o, 3) 

+ 036^ (-471 4,0452^^0, 3-5) + a*/ (-4472,1 360 «o, 4) 

+ a4.6i' (-4264,0143^0, 4 5) + a, 7 (-4082, 4828 «o, 3) 

+ as.6 7 (-3922,3227^0. 5-5) + a« 7 (-3779,6447 «o, 6) ...(xviii), 

fo == -7=..^ , ^0 “ t 1 as m (xvii), 

\p — 1 jp — 1 


Po = V27r {p -T) (p-iy-^ e-^p-^^fr {p\ 

and equals by Stirling’s Theorem 


fi+I .. 

V 12(p-l) 288(p-l)* 


V 12(p-l) ■ 288(p-l)* 51840 (p-1) 

which will generally be adequate for computing P©. 

I^astly we have 

^ •OARn/9 1 Rorvo "^0 _ r\r\/\t 


1 

-iy+-) ’ 


ax = -26696,16203 -,r ? - , 

vp — 1 


ai.5=- -37500,00000 


(p-1)’ 
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Evaluation of Incomplete Eulerian Integrals 


at = -63830,76486 «*•» = -41666,66667 ^ ^ j) . 

12 1 


a, = - 1-59676.91216 (l - l-^) 

03 6 = 5140625 f 1 - ^ T-- ? 

V 47 /) -ly (p-1)*’ 


P 

171 

P 

1377 1 

175 


at = -9466,40961 ( 1 - ~ + 18 

\ 10»-1 (ji> - 1)V (p - 1)‘® 


tt*.# = - 6-5625 (^1 
at = 32-9792,2852 ^1 - 
as s = 2-67361,11111 - 

a* = - 28-3692,2883 (l - 


(p - 1)V (p - 1)‘ 

JL ■ 36 1 \ Po 

p — 1^ 5 {p — lf){p — 1)*’ 


36 


1 ^ 1440 


1 Po_ 

-1)V (p-'l)*®’ 


7 (p-1) 341 (p 


4437 


30141 

80 (p-1) ' 140 (p-1)® (p 


+ 


162 \ P„ 

)-!)»/ (p-1)®’ 


15759 2493 

+ , 


648 \ Po 

700 (p - 1) 3'5 (p - 1)® la (p~- I )*; (p - 1 )® ® 

(xix). 


Illustration I. Let us now test the above results on a sample case ; we will 
take the worst possible case, namely p = 50 on the border of the F-Function 
Tables, and find from those tables the value of the required r-function ratio by 
which to test our expansion. Let us take oco considerable, =2v'p — 1, i.e. = 14. 
Accordingly 

?b= 7 ^^- = 2, and Zo = ^-£~=% 

Vp — 1 p — 1 

also. Further, = (p — 1) + = 63, and the actual value 

= P (63, 50) = I' , p - 1^ = 7 (-8909,54543, 49) = -959,5825 

by interpolation from the Incomplete V^Function Tables, 

Again, 1 area up to the mode, 

= / 49 ) = I (6-9296,4646, 49), 


and from the same tables, 


= *462,1044. 


Calculating Po, we find for its value, whether from the full value or Stirling's 
approximation, *9983,00788. The normal term is 


Po X P ™ = Po X -477,: 

J 0 v27r 


2499, 
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since fo ■“ 2. Thus this term 
we find 


•476,4390. Calculating the a/s and 7'^^"=^ 




I. 

X 

> 

«1 

+ '0379,2994 

/(1-4142,1366, 1) ==*693,9947 

+ *022,5302 


~ -0076,4006 

2(1-2649,1106, 1-6) ^*460,681 6 

- -003,4426 

02 

+ -0018,6779 

/(I -1647,0064, 2) = -323,3226 

+ -000,6007 

02*6 

+ -0079,6922 

7(1*0690,4497, 2*5) « *220,2225 

+ •001,7560 

O 3 

- -0044,8199 

/(I • 0000 , 0000 , 3) =-142,8766 

- -OCX), 6404 

^3* ft 

+ -0020,8171 

/(0-9428,090&, 3-6)= -088,6874 

+ -000,1844 

04 

+ *CK)18,1243 

/(0-8944,2719, 4) =-062,6630 

+ *000,0954 

«4-r> 

- -0022,9861 

/ (0-8528,0287, 4-6)= -030,0831 

- *000,0691 

^ft 

+ •0017,6674 

/(0-8164,9668, 6 ) =-016,6637 

+ -000,0291 


~ -0000,4846 

/(0-7844,6464, 6 - 6 ) = -008,8087 

- -000,0004 

Ort 

- -0009,6016 

/(0-7559,2895, 6 ) = -004,6338 

- -000,0045 


These include all the terms up to the order 




We will now sum these terms. The sum of the 1 » P “ normal 

term is *938,5434. Adding in the ai term, we have *961,0736, whence we proceed 
as follows : 


Terms 

included 

Approximate 

value 

Size and place 
of error 

Terms 

included 

Approximate 

value 

Size and place 
of error 

Up to Ui 

,, 02 

,7 

77 ®:<-ft 

•961,0736 

•957,6311 

•958,2318 

•959,9868 

•959,3464 

•959,5308 

+ 1 in 3rd 

— 2 in 3rd 
™ 1 in 3rd 
+ 4 in 4th 

— 2 in 4th 

— 6 ill 6 th 

Up to 04 

77 04.5 

77 

77 ^ 6 -fi 

77 O0 

•959,6262 

•959,5571 

•959,6862 

•959,5868 

•969,6813 

+ 4 in 6 th 

- 2 in 6 th 
+ 4 in 6 th 
+ 3 in 6 th 

— 1 in 6 th 

True value 

•959,5825 

0 in 7 th 


Thus with 13 terms of the expansion (xviii), we have only an error of a unit 
in the sixth decimal place. Accordingly if a table* could be computed of Po and 

the I ~ together with the first eleven a*s, all values of the Incomplete 


F-function could be thrown back on the existing tables for values of p > 50, with 
an accuracy of the order *000,001, or thereabouts. 

Of course one fully admits that this expansion of high order F-fu notions in 
terms of low order F-functions suffers from the same defects as expansions 
in incomplete normal moment functions, or in tetrachoric functions. But such 
a table as we have spoken of would probably provide all the accuracy needful 
in the majority of statistical problems, and would be leas laborious than the use 
of quadrature formulae. 


* It might be needful to form the table for every unit value of p from 50 to 100, but later it would 
probably be adequate to take the intervals at several units distance. 
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The table above indicates that with every additional a« term included the 
approximation becomes closer. Further it indicates that for examples such as 
our first illustration, we shall have accuracy to the third decimal place if we 
include terms up to aa. 5 , to the fourth decimal place if we proceed to 03 . 5 » while 
up to ag we have accuracy to the fifth decimal place. 


Illustration II, We must work another example to illustrate the process, 
namely p ** 101, and we will take our xq « 120, i.e,^p — 1 + 2 Vp — 1 , so that we 
are finding 


/•lao 

c“*a^«®da?/r(101). 
J 0 


By choosing ^ 0 ' =* 20 = 2 Vp — 1, our entire series of / { — , s) remain the 

Ws + 1 / 

same and need not be recalculated, all the a/s, however, are changed as well as Po • 
We find Po ■“ *9991>fi7()2, and since remains the same, the normal term 

contribution is 


•9991,6702 X *477,2499 = 

rp—i 

Further* / dxJT {p) = 

J 0 

Together, these give us *950,2926. 

We will now form a table as before : 


= *476,8524. 
•473,4402. 



Value of 

Value of 

a, I, 

Value up to 

Fxact value less 
series value 
up to Ug 

«1 

+ *0265,73998 

•593,9947 

+ *015,7848 

•906,0774 

- -OCX), 7453 

«!•/» 

- *0037,46876 

•450,5816 

- *001,6883 

•964,3891 

+ *000,9430 

«2 

+ *0006,37776 

*323,3226 

+ *000,2062 

•964,5953 

+ *000,7368 

GTar, 

+ *0040,38300 

•220,2225 

+ *000,8893 

•965,4846 

- *000,1625 

^3 

- *0015,67107 

*142,8765 

- *000,2239 

*965,2607 

+ -000,0714 

<*8-6 

+ *0005,07077 

•088,5874 

+ *000,0449 

*965, .3056 

+ *000,0265 


+ *0007,84984 

*062,6530 

+ *000,0412 

•965,3468 

- -OK), 01 47 


- *0006,04581 

•030,0831 

- *000,0182 

*966,3286 

+ *000,0035 


+ •0003,12710 

*016,5637 

+ *000,0052 

•965,3338 

-•0(X),0017 


-•0001,19509 

•008,8087 

- 000,0011 

•965,3327 

-*000,0006 


-•0002,21647 

•004,5338 

-•(XX), 0010 

*965,3317 

+ *000,0004 


The exact value was found by quadratures from aJb = 120 to the end of the 
curve to be *034,6679, and 1 — *034,6679 = *965,3321. It will be seen that our final 
result, using up to the Og term, is only four units out in the seventh decimal place. 
Up to a8.6 we are only out two units in the fifth decimal place, and up to ag.g only 
between one and two units in the fourth decimal place. 

It will be clear to the reader that to replace an incomplete F-function by a 
normal curve will not give a correct result even in the second decimal place, when 
p a« 101, if we want an area reaching to about twice the standard deviation from 


Found by quadrature from 45 ordinates at unit distance apart. 
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the mode. It would work better, of course, if we knew the area up to the mode, and 
were only seeking areas up to ± v'jJ on either side of it. But clearly the safest way 

to reach high order incomplete F-functions is to form a table of and 

« , - . . Jo r (p) 

of the functions of p, i.e. Po, Ois ••• oq, and so throw such incomplete F-functions 
back on the already published table. 

Will any trained computer volunteer for the work? 

II. The Incomplete 'B-Function, 

Let the required integral be the B-function ratio 

^ ^ Jo B(p + i, (/+i) (0. 

where B (p + 1, ? + 1) = r(p + 1) FC? + l)/r (p + y + 2>. 

We will suppose q >p, which it is always feasible to arrange, if needful, by 
aid of the transformation a; « 1 — on , and finding the area from the other end of 
the curve. 


We have 

J + 1 a + 1) r*(p + y + 2) 

^ F(p + l)F(y + l) 

_ F(p+</ + 2 ) 

F(p + l)F(3-i-l) 

_ F(p + 7 + 2) 
r(p + i)r(y + i) 

We may now expand the exponential 


p a;Pe'i log 
Jo 

f “ aji' e - 9 + i da, 

Jo 

r o 

^4 dsc. 


a*® i 

? 2-9 3-9 


as equal to 

and our integral becomes 


1 + C2ic»+ c%a^-h... +Csa^ 




r(jt> + y + 2) J po ojP e-9* due 

r(g-fi) (Jo r (p + i ) 

r^o aiP^^e-^dw 

■r(p + i)- + - 


F(p + 1) 


F (p + y + 2) 1 7' (^0, p -H ) J' {zq, p + 8) 


r(y + i) 


3 'ip + 2) fp+1) 


I' izoi p d" 4) , .. V . 

^^(p + 3)(p + 2>(p4-I)... 


r iza,p + 8+l) 


where we have taken z^qx and Zo = qooo . 


(p + s)(p + s — 1) ... (p + 1) + ...j- 
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If we now use the notation of the Tables of the Incomplete T -Function we have, 
on putting p — 1 for p, and q^l for q. 


(jo. 9 ) = ~ ^ P ’ 


= r' ( i* + g) 

r(9) 


p_ 

iq-iy 


(X^{q-\) \ {p + l)p 

^ I Vp + 2 )(.q- 1)"+* 


. 0./ .) -..) (>') 

Thus if p < 50 and q > 50 , we have reduced our integral to a series of in- 
complete F-function ratios, which can be taken out of the above-mentioned tables. 
Wo have to determine the c’s and to consider the convergency of the series. 

Turning back to (ii) and taking logarithmic ditferentials, we have 

(— qx — qx^ — qa^ — ... — qaf — ...)(!+ C 2 X^ + c^x^ -h . . . -h Cf^af 4- • . . ) 

*= 2 c^x -f 3 c 3 (r* + . . . 4* 4 * 

Hence we easily find c?2 J C3 == — J 5^, and generally 

(V). 


« — 1 q 

' Gg_i - Ca_2 

s s 

To suit the notation of (iv), we must, however, put — 1 for q^ and we then 
have 

g— 1 7 — 1 1/ 2\ 

C2=-''-2-, C3 = -'3-. C4 = g(9-l)*(l- — j), 

^ ” 24^^“ 7 C^i)*)’ 

_ 1 . 68 696 48 \ 

“ 384 ^5- 1 J 1 3 (^ _ 1 ji + 10 (3 - 1 )* W-if) ’ 

“•-n. <«-’>*(» 


- i)V ’ 


472 


892 


^ 16 \ 
)* ((/-i)»r 


45(g-l) ■ 35(?-l)« (9-l)» 

1 , 1 ^6 /'i 140 964 23088 384 

3840 ^ 3(2-l)‘^3(g-i)* 35 - 1)» 1)V 

916 3716 6704 1152 


4). 


” 4^0 ~ (^ “ 3 


45 (^f - 1 ) 35 (3 - 1 )» 35 (3 - 1 )» ^ 1] (3 _ 1) 


i‘)’ 


28828 


478,024 


155,552 


, 552 _ 3840 \ 

-1)* (3-l)V 


(3 -1)^27 (3-1)* 105 (3-1)*^ 21 (3-1)* (3-l)‘ 

(vi). 
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Now let us write (iv) in the form 

(P. ?) = 0^1 («o (9 - l)/yp, p-i) + a,7 (xa {q - l)/v'p + 2 , p + 1 ) 

+ 0*7 (xoiq — iyi'^p + 3, p + 2) + ... 4 - o,7 (xo(q — 1)/ Vp +«, p + s — l) + ... 

(vii). 

Then we have 


Oo 

a# 


a4 


^5 


^6 


r (p + 9) 1 „ = _ r (p -f 9) (p + i)p 

r(9) (9-1)'” * r(9) 2(9-1 )*h-i* 

_ r(p-i-g') (p + 2 )(p + i)p 
r( 5 ) 3(j-i)p+» ’ 

4 . r * (p + g) (p + 3) (p + 2)(p+l) p _ 2 \ 

r(g) " 8 («-l)P+«“ V 9 - 1 /’ 

. r(P + 9 ) (p-i-4)(p +3)(p + 2 )(p + l)p / _ c \ 
r(g) V 5(q-l)J’ 

_ r’jp + g) (PJ!: 5) (p + 4) (p-» -3) (p + 2 )(p-<- l )p / _ 26 8 \ 

r( 9 ) i8(q-i)p^ r 


Oj 


oa 

«* 


aio 


f (P + g) (P + 6 ) (P + 5) ••• ( P + 1) P A _ 22 24 \ 

r(9) 24(3-1)»m-« V 6 (3 -1)'^ 7(9- 1)V’ 

r(p + 9 ) (p+7)(p + 6 ) ... (p + l)p _ 68 696 48 \ 

r( 9 ) 384 ( 9 V 3 ( 9 - 1 ) ■^ 10(9 -1)* ( 9 -!)*/’ 

r(p + 9 )(p + 8 )(p + 7)...(p + l)p/ 472 892 16 \ 

r( 9 ) 144(9-1)p+» V 46(9-i)‘^36(9-l)» (9_1)»J’ 

r(P + g) (p + 9)(p + 8 )...(p + l)p 
r( 9 ) 3840 ( 9 -1)*^® 

/ _ 140 964 2.3088 384 \ 

3 (9 - 1) + 3 (9 - 1)* 36 (9 - 1)» (9 - i)*; ’ 


(711 = 


ail = 


r( p + 9) (Pjt- 10) (p + 9) (p + l)p 
r ( 9 ) ii52 (9 - i)p+» 

/ 916 3716 0704 1152 \ 

^ V 46 (9 - 1 ) 35 (9 - 1 )* 35 (9 - 1 )» 11 (9 - 1 )V ’ 

r (p + g) (p + 11 ) (p + 10 ) ... (p+ i)p 

r( 9 ) 46080 ( 9 - l)P+« 

/ 250 28828 478,024 166,652 3840 \ 

3 ( 9 -!)■*■ 27 ( 9 -I)* 106(9-1)*"'’ 21 ( 9 -l)« (9 - 1)®/ 

(viii). 


Illustration (i). As a first example, we will find the incomplete B-function 
ratio 


7 . 014 ( 4 , 101 ) 


or, if (1 — a?). 



(4, 101), 

(1 (4, 101). 


Biometrika xxvn 


27 
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This integral is fairly easy to find by the expansion of (1 — a;')* and direct in- 
tegration, using many figure logarithms to determine (*976/®^, which equals 
•0859,8675,99. Thus we obtain 

7.024(4, 101) = -2402,41262. 

We now proceed to compute our a*s from (viii) and determine the /'s by 
interpolation. The results are given in the table below : 




, Values from 

r-Tables 

a,xl. 

Value of S 
up to a. 

«0 

^3 

a4 

«« 1 
(27 

H- 1*1035,5024 

- *1103,55024 

- *0044,14201 
-1- *0113,56532 
+ *0012,21145 

- *0012,71078 

- *0002,66953 

/o(l*2, S) -- *2212,7710 

72 (-9797,95896, 5) =- *0356,7257 

7, (-9071,14735, 6) =:*0116,9372 

7* (*8485,28136, 7) =*0033,3734 

/ft^8, 8) =-0008,6210 

7o (-7589,46648, 9) =*0002,0019 

It {'7230,27229, 70) = *0000,4249 

•2441,90397 

- *0039,36647 
-*0000,61180 
-h *0000,37897 

4- *0000,01053 

- *0000,00264 
+ •0000,00011 

*2441,90397 

•2402,63750 

*2402,02570 

•2402,40467 

1 -2402,41620 

1 *2402,41266 

•2402,41277 

True value 

= •2402,41262 


Now since our 'Table of the Incomplete T~Function Ratios is only computed to 
seven decimal places, we cannot trust the last column to more than seven decimals. 
It is thus not worth while including ag. At a® we are correct to seven decimals. 
Thus when q is large compared to jt>, six terms are adequate to obtain seven-figure 
accuracy. When four-figure accuracy is adequate, a^ will suffice. To obtain ade- 
quate values for the 7/s we used interpolation for og and ag, but found that 
with ttg the inclusion of although modifying /g in the seventh decimal place, 
gave no change in the value of Og/g up to the ninth decimal place. It is accordingly 
sufficient to interpolate the 7, values, using only 8*, except in the case of the 
values 5=2, or perhaps 5 = 3, as p increases relative to q. 

It would thus seem that the method is satisfactory if p be as low as 4. Further, 
it can be a good deal shortened, if we do not use formulae (viii), but proceed in 
the following manner. We have seen by (v) that 


and further, by (iv), that 


- 1 q 


.(V). 


„ + ...{p + l)p Tip-^q) 

" * (?-!)• r(s)(g-l)»> 

-o.x/._oc. 

wftere O - , and /. _ c. 


.(ix). 


Hence in (v), replacing c, by a,, we find 

ly “ 1) “*-1 - (p + « — 2) a,_,} (x), 

or, again, f, =. {(« “ - (p + « - 2)/_,} (x hid). 
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The latter formula has some advantages if we wish to deal with more moderate 
numbers, as G may be large. We then multiply by G at the end of our desired 
series of //s. 

Now the series of a,*s or //s can be most rapidly ascertained in succession by 
the machine from (x) or (x his). For example, Oo «= O, oi « 0, or « 1, /i » 0, and 
for p = 10, = 101, say, we have the series 

~ iWtr (®1 ““ 19ctoX «8 = (2ctg — llai), 12^2), 

«« = (4^4 - 13a8), - 14a4), etc. 

The rapidity of this method is great, but it h€U3 the disadvantage that, if a slip be 
made in one a,, it is carried on to later a’s. Hence it is advisable to check the 
last a by means of the corresponding value of a in (viii). Thus there is no 
difficulty in finding the a’s to the s needed to give the number of places necessary 
in the required incomplete B-function. In the example worked out above, we 
obtain seven decimal-place accuracy for 5 = 6. 

Actually at the very best we cannot hope for greater accuracy, for the Tables 
of the Incomplete T-Function only reach to seven decimals, and accordingly, as G 
grows greater than unity, we shall not be correct in the sixth or seventh figure 
when we multiply by G, We need a F-function table to some 10 to 12 figures, 
if G lies between 300 and 400. We will illustrate these points on a couple of 
examples. 

Illustration (ii). Let us take p = 10, q = 101 and ccq = *075. Here 


and the general form for 


a = 1-7018,21 43,78, 


and further 


L 


A, 

V vp 4* 5 / \vp + 8 

/* = {(» - - (8 + «)/.-»} • 



Working the /’s out in succession from this, we obtained the values given in the 
second column of the table below. 


The true value of the function found by “brute force ” integration and multiplying 
out was *3109,71212, and checked by quadrature -3109,7120. The value -3109,712 
will be correct to seven places and serve for comparison. 


Table for p = 10, q = 101 and ccq = *075. 



■f. 


. From F-Func- 

• tion Tables 


nptoa. 

/o 

1 

•/» 

1 

p 

1-0000,0000 

- *55 
-•044 
+ -21021 

+ •0395,6962 
--0686,2866,00 
-•0229,7618,08 
+ ■0199,1616,23 
+ •0109,9814,68 

- -0049,3064,11 

- -0046,9683,97 
+ •0008,2177,63 

1*7018,2143,78 
~ -9360,0179,08 
-•0748,8014,33 
+ •3677,3988,44 
+ -0673,2323,92 
--1167,9366,47 
- -0390,9966,62 
+ *0338,9373,49 
+ •0187,1688,03 
-•0083,9107,07 
-•0079,9148,07 
+ •0013,9851,48 

/o (2-371 7,0824,6, 9) = -2235,9236 
72(2*1650,6350,9, 77)== *0792,4131 
/s (2*0801,2673,6, 7;s*) = *0426,668:1 
/4 (2-0044,6931,4, 7.^) « *0215,6463 
4 (1 *9364,9167,3, U) = *0102,6042 
/fl (1 *876, 15) = *0046,0831 

/7 (1*8190,1718,8, 76*) ^*0019,5890 
4(1*7677,6695,3, 77) « *0007,9001 
4(1*7206,1800,4, 76’) -*0003,0301 
7,0^-6770,5098,3, 7£>)» *0001,1074 
/ii ( 1 -6366,341 7,7, ^0) = *0000,3864 
/,a (1-6990,0537,3, ^7) =.-0000,1288 

+ *3805,1427 

- *0741,7001 

- •(K)31,9482 
+ •0077,1453 
+ *0006,9076 

- *0005,3822 

- *0000,7659 
+ *0000,2678 
+ *0000,0567 

- -0000,0093 

- *0000,0031 
+ -0000,0002 

+ *3805,1427 
+ *3063,4426 
+ -.3031, 4944 
+ •3108,6397 
+ •3115,5473 
+ •3110,1661 
+ •3109,8992 
+ -3109,6870 
+ *3109,7237 
+ *3109,7144 
+ •3109,7113 
+ •3109,7115 


27—2 
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The a's are correct to their ten decimals and there is no difficulty in tabling 
more, in fact ois, au, and dxt were found. But to make use of these a/s it 
would be needful to have the //s to more than seven figures. The J,’s found by 
interpolation have beeu used to eight figures, but the last figure is, of course, 
unreliable. The value of 0 being > 1, the eighth and ninth figures of I for the 
lower values of 8 would be needful to obtain the value of the incomplete B-fiinction 
exact to the seventh decimal place. Thus the value obtained from the P-Function 
Tables is *3109,7116 or six units in error in the eighth figure. This is what we 
might have anticipated. On the basis of this example we can assert that for 
p = 10 the true value can be found to some six units in the eighth decimal* 

Application of Dr Muller's Method. 

In order to test the labour required in working various processes for finding 
the incomplete B-function, we worked our /.075 (10, 101) by Miiller’s Continued 
Fraction Method*. The results are given in the table below. It may be noted 
that when two successive convergents agree to the number of decimal places we 
require, we may stop our computing, and take the required value as that common 
to the two convergents. 

We shall use practically Dr Mttller’s notation*. The true value of 

/.075 (10, 101) = •3109,71211, 

= + l = j»o = l- % = -925, O = = •1086.2676,9752, 

.... „ r(p + i)r(9) 

_ 101 - r ^ Xo ^ 

^ ToTr * ^""^""37* + ^r-2 • 


r 

br 

Pr 

Vr 

C*p,./q,. = con- 
vergent to 

/.ovfidO, 101) 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 I 

11 

12 

13 

14 

1-0000,0000,00 

- -7371,0073,71 
+ -0081,8181,82 

- -6660,0831,60 
+ -0997,9209,98 

- -4540,5405,41 
+ -1146,2702,70 

- -3768,9427,66 
+ -1208,2670,91 

- -3186,3442,39 
+ -1226,8847,80 

- -2760,9662,61 
+ -1221,4812,21 

- -2409,9989,32 

1 -0000,0000,00 
1-0000,0000,00 
1-0681,8181,82 
-5021,7360,22 
-6087,6960,88 
-3807,6569,43 
-4604,7626,77 
-3073,5238,14 
-3617,8194,64 
-2638,4889,74 
-3082,3537,38 
-2366,6146,90 
-2733,0183,11 
-2166,0985,46 

1*0000,0000,00 

•2628,9926,29 

•3310,8108,11 

•1822,7791,20 

•2153,1718,83 

*1325,5316,34 

•1672,1280,08 

•1073,8682,63 

•1263,8233,06 

•0921,6619,14 

•1076,7084,72 

•0823,1652,33 

•0954,6831,51 

•0766,3004,18 

•3112,58264 

•3109,01233 

•3109,64894 

•3109,74816 

•3109,71947 

•3109,71063 

•3109,71186 

•3109,71217 


It will be seen that convergents 7 and 8 give three-figure accuracy, 9 and 10 
four-figure accuracy, 10 and 11 five-figure accuracy, 12 and 13 six-figure accuracy. 


BimMtrika, Vol. xxn. pp. 291 et seq. See also our first footnote, p. 428 below, 
t Value obtained by brute force,’» by quadrature and by the r-Punotion Table. 
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and 13 and 14 the required seven-figure accuracy. Muller’s method is certainly 
shorter than the " brute force ” or quadrature methods, and about on a par with 
the F-Function Table method at p « 10. 

Illustration (iii). A still more extreme case was now taken, namely, 
p = 36, 5 = 101 and -20. 

Here 0=392*0702,0930, and it is clear that with a seven-figure table only 
of the incomplete F-function we cannot hope after multiplying by such a 0 to 
reach more than about four-figure accuracy for an incomplete B-fu action. 

We found the following values : 


Table for p = 36, q = 101 and xq = *20. 



f. 


j Valaea from 

« r-Tables 


S (a^4) up to 

/» 

y 

/» 

y,« 

1 

1*0000,0000 
— 6*66 

- 1*6872 

+ 24*181,794 
+ 13*0022,3808 

- 61*6544,7226 

- 54*1811,1669 
+ 118*7993,2634 
+ 160*3644,8196 
-170*2750,5096 

- 372*9827,2825 
+ 146*0854,9140 
+ 711*9955,9844 

392*0702,0930 

- 2611*1876,9394 

- 661*5008,5713 

+ 9480*9610,3483 

+ 5097*7902,0539 

- 24172*8818,4326 

- 21242*801.3,8869 
+ 46577*6767,4282 
+ 62874*1360,0242 

- 66759*7748,6846 

- 146235*4163,3026 
+ 57275*7691,8889 
+ 279152*26.33,0105 

/o (3*3333, .3.333, 3, .9,7) « *0008,0363 
(3*2444,2842,1, . 97 ) = *0002, 1708 
(.3*2025,6307,7, .9.9) « *0001,0868 
4(3*1622,7766,0, .9.9) « *0000,5323 

A (3*12.34,7623,9, = *0000,2547 

4(3*0860,6699,9, .^i)* *0000,1192 
/7 (3*0499,7140,9, 4^^) = *0000,0541 
4(3*0151,1344,5, 4^)== *0000,0239 
4(2*9814,2396,9, .^.^) = *0000,0102 
/lo (2*9485, .3912, 2, ^5) « *0000,0045 
/„ (2*9172,9982,6, 46) « *0000,0022 
43(2*8867,5134,6, .^7)= *0000,00105 
43(2*8571,4285,7, 4^) = *0000,00050 

+ •3150,79.38 

- *6668,3660 
-•0718,9191 
+ •5046,7166 
+ *1298,4072 
-*2881,4076 

- *1149,2.356 
+ *1113,2065 
+ *0641,3162 

- *0.300,4190 
-*0321,7179 
+ 0060,1396 
+ •0139,6761 

+ *3150,79.38 

- *2617,5722 

- *32.36,4913 
+ •1810,2243 
+ *3108,6315 
+ *0227,2240 
-*0922,0116 
+ •0191,1949 
+ *0832,6111 
+ *0532,0921 
+ *0210,3742 
+ *0270,5138 
+ *0410,0899 


The true value is *0409,4983, or the excess at this point is roughly six units in 
the fifth figure. But even this agreement is illusory. It depends on taking the 
value of Ig to eight or nine figures in the last two cases, where our tables only give 
seven figures! Had we taken /i 2 = *0000,00104 instead of *0000,00105 we should 
have been only two units in error in the sixth figure instead of six in the fifth 
figure. But the reader will ask how we found Jia and /is. If he will turn up p = 47 
and the argument lying between 2*8 and 2*9 in the Incomplete V-Function Tables^ 
he will find that he has to interpolate between 

i/o = ‘0000,000, 3* Uo = *0000,000, 3* ?to = ‘0000,000, and 

wi= *0000,001, = *0000,002, 3^= 0000,001, 

and matters are still worse for /is. These units in the last figure may be anything 
from just over 05 to just under 16, and when we have to multiply by over 392, 
what they are is essential. Actually interpolating from the Tablet the value found 
for /ii is *0000,00080, which can only be taken as *0000,001, but in multiplying 
by over 392, it is essential to know whether 80 or 100 is the better value to adopt. 
Actually, when the a,’s are as large as they are in this example, a table of the 
incomplete F-function ratio would have to exist to at least ten decimal places. 
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It is not the large number of a# s required which really distresses us — they are not 
hard to run off on the machine. The difficulty is that the /,*s cannot be obtained 
to an adequate number of places. How then did we select to be *0000,00105 
and /i 3 to be *0000,00050? Simply by looking at the column and noting that 
7,4.1 is always somewhat less than half /,, the preceding value. A round number 
under half 22 is 105, and a round number under half 105 is 50 ! It is clear that 
such a process is by no means to be commended, and taken together with the 
erratic values of the a/s, so obvious in the products, a,/,, we must conclude that 
the reduction of an incomplete B-fu notion to the Tables of the Incomplete T -Function, 
while satisfactory forp = 10 and gr= 101, and probably some way beyond this value 
of p, fails to be a practically useful method when p gets to 36, or some third or 
more of We need, therefore, a more satisfactory method when p and q approach 
more nearly to equality, and one or both lie outside the B-Function Table, 

Various methods have been applied, such as expanding the incomplete B- 
function in terms of incomplete B-functions lying inside the published Incomplete 
^-Function Table, but none of these seemed for p and q of the order of 36 and 101 
as short as Dr Muller’s continued fraction method. That method has peculiar 
advantages, when we only need the incomplete B-function ratio to four or five 
decimal places; it converges steadily from the early C while the (a,/,) 

may be most erratic before it approaches the true value. 

In all the methods there are many chances of even a good computer making 
a slip. But systematic arrangement of the material and experience in the use 
of the methods makes them far shorter than appears at first sight. The reader 
must also remember that it is solely the mathematician who, for special purposes, 
will aim at seven-figure accuracy. For the statistician, four- or even three-figure 
accuracy is often adequate. 

We give in the table below the data requisite to find /.g (36, 101) to seven- 
figure accuracy by continued fractions. 


7.8(36, 101) « •0409,4983 (by quadrature). 
p = 36, q = 101 , k ^ p q \ ss 136, scq = *2, yo =*■* 1 — ^0 = ‘8, 


(7 = 


Arl) 

r{p + i)rXq) 


= *0147,53914, 


101 -r 
36 +r ’ 


Vo 4’ 


Pr — 2>r~l + brPrSii Wqr-~2*» 

Thus the eighth and ninth convergents give the value correct to four figures, 
the eleventh and twelfth correct to five figures, the thirteenth and fourteenth to 
seven figures as required. 


* The general expressions for the ^*s are 

2, _ (p4-r- 1) _ r(k + r) 

^(p + 2r~2) (/> + 2r-l) (l) + 2r- 1) <j> + 2r) * 

p,=:gi = l, and Pa = l- 


while 
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r 

K 

Pr 

9r 

Continued 
fraction p^lqr 

converging to 

J.a(86, 101) 

1 

-f-l 0000,0000,00 

1-0000,0000,00 

1-0000,0000,00 



2 

- -6756,7667,67 

1*0000,0000,00 

•3243,2432,43 

— 



3 

+ -0243,6988,62 

1-0243,6988,62 

•3486,8421,06 

— 



4 

- -6178,1407,98 

•4064,4490,64 

•1482,7935,22 

__ 

— 

5 

+ *0442,3076,92 

-4617,6313,21 

•1637,0192,30 

— 

— 

6 

- -5676,8292,68 

•2210,2129,81 

•0795,2626,64 

2-7792,2386,45 

•0410,0443 

7 

+ -0606,4006,97 

•2483,7046,42 

•0894,3679,22 

2-7770,5022,83 

•0409,7236 

8 

~ *5236,7109,63 

•1326,2799,87 

•0477,9118,61 

2-7751,6609,67 

*0409,4441 

9 

+ *0739,9577,17 

•1610,0636,29 

•0644,0912,96 

2-7753,8648,40 

•0409,4789 

10 

- *4848,4848,48 

•0867,0187,87 

•0312,3764,69 

2-7765, 5738, 28 

•0409,5033 

11 

+ -0861,4492,76 

•0996,5930,46 

•0368,7030,73 

2-7755,3631,02 

*0409,5001 

12 

- -4503,9316,45 

•0605,0937,19 

•0218,0108,64 

2-7755, 2807, 11 

•0409,6127 

13 

4- -0944,1489,36 

•0699,0926,30 

•0261,8777,66 

2-7755,2298,92 

•0409,4983 

14 

- -4196,4285,71 

•0455,1692,73 

•0160,3910,68 1 

2-7755,2408,97 

•0409,4984 


Conclusions, Methods have been indicated of throwing back values of the 
incomplete F-function ratio and the incomplete B-function ratio lying outside 
the ranges of the published tables on the Tables of the Incomplete T-Function, 
These methods work adequately for the F-function and also adequately for the 
B-function, if p be small as compared with q. But in the latter case, if p be com- 
parable with q, the constant C =» p becomes very large, and the existing 

tables of the F-function ratio only giving seven decimal places are inadequate for 
the method, which would require at least ten decimal places. After considering 
various methods for determining an incomplete B-function ratio with comparable 
p and q, we believe the most efficient to be Mtlller s method of the continued 
fraction, illustrations of which are given in the paper. 


The authors of this paper have not overlooked the value of the late Mr Soper's 
tract, Numerical Evaluations of the Incomplete H-Function*y but have endeavoured 
to make a somewhat more direct answer to the question often put : What is to be 
done when we need the numerical value of incomplete Eulerian integrals lying out- 
side the existing tables ? 


In conclusion we desire cordially to thank Miss F. N. David for kindly working 
out certain values for us, as, being away from home, we had no machine, and no 
table of more than seven-figure logarithms. 


Tracts for Computers, No. VII. Cambridge University Press. 




FURTHER INVESTIGATION OF THE MORPHOMETRIC 
CHARACTERS OF THE INDIVIDUAL BONES OF THE 

HUMAN SKULL. 

By KARL PEARSON and T. L. WOO, Ph.D. 

( 1 ) I ntroduction . 

In 1930 the then Director of the Biometric Laboratory determined that the 
time was ripe to undertake a wide system of measurements on the individual bones 
of the human skull, A series of 63 measurements was selected, which could be 
taken on the non -disjoin ted crania, and the term morphometric was introduced for 
such measurements to distinguish them from the existing ethnometric measure- 
ments, which may cover several bones at once. It was noted, however, that mor- 
phometric measurements may be found to have first-class ethnographical value 
when, in the future, they shall have been taken on a number of racial groups. It 
was proposed to start the morphometric measurements on an adequately long series 
of crania, and discuss in the first plewse the results which flowed from the measure- 
ment of homologous bones. The well-known E series of Egyptian crania of the 26th 
to the 30th dynasties, numbering about 800 male crania, were chosen, and an 
indefatigable worker, both from the measuring and computing sides. Dr T. L. Woo, 
was selected to undertake the work in 1930. After a year’s labour in measuring and 
computing, Dr Woo published, in 1931*, the first report on these morphometric 
measurements, with some assistance from the Director of the Laboratory in the 
preparation of his text. This dealt with the 50 characters out of the 63, which be- 
longed to homologous bones, i.e. 25 pairs of corresponding measurements on the 
right and left sides of the skull. The whole report dealt with absolute sizes, and 
not with the shapes of individual bones. The discussion was chiefly concerned 
with asymmetry in size. The following conclusions were reached : 

(i) The mean or type skull of a race is definitely and markedly asymmetrical ; 
asymmetry is not only a peculiarity of the individual skull; it is also a racial 
character. 

(ii) The frontal and parietal bones are with marked significance greater on the 
right side, so also are the approximately antero-posterior lengths on the temporal. 
Both malar measurements are in excess on the left, but these measurements have 
little to do with the brain. On the other hand while the arcs from asterion to 
sagittal plane ■!*, and from asterion to basion were significantly greater on the right, 
that from the lambda to the asterion along the lambdoid suture was vnarkedUy 
dominant on the left. 

* Bionietrika, Vol. xxn. pp. 824 — 852. 

t For definition of this (and other) measurements see ibid. pp. 825 — 820. 




Diagram 1 
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(iii) The variability, either absolute or relative, showed no marked lateral trend, 
and the greater lateral variability was not significantly associated with the greater 
lateral characters. 

The report showed the large part played by homology in the correlation of 
morphometric characters. It also indicated that facial and frontal homologous 
characters were more highly correlated (‘86 — *98) while those of the temporal, 
parietal and occipital regions were less so (‘64 — *86). The face, possibly owing to 
sexual selection, is the most symmetrical region of the skull. This report, however, 
left much for future consideration and the present paper will deal with many 
problems left unanswered in the first report. 



Diagrams 1 and 2, giving respectively the norma lateralis and norma basalis 
of a skull, indicate approximately the measurements taken during the whole of 
this investigation. 
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( 2 ) Oonstants of the Thirteen Additional Characters. 

We place first on record the remaining thirteen characters dealt with in 
Dr Woo's measurements ; these were principally but not entirely taken along or 
perpendicular to the median sagittal plane. We shall not repeat here the defini- 
tions of the 60 bilateral characters discussed in the first report, but we reproduce 
here the diagrams indicating those characters with addition of the thirteen further 
characters. 


List of the Thirteen Additional Characters. 


Frontal 


Parietal 


Fg, Minimum arc between the terminals of the minimum frontal di- 
ameter. {B*}. 

Fi, Minimum arc from nasion to bregma. {/Si}. 

Fg, Minimum arc from right stephanion to left stephanion. 

{Pi, Minimum arc from bregma to lambda. (/S-g). 


Occipital 


Oi, Minimum arc from lambda to opisthion. {/S^a}. 

Oa, Chord from opisthion to basion. {fml\. 

Og, Chord from basion to spheno-basion. 

O 4 , Minimum arc from right asterion to left asterion. 

Og, Chord between the points, right and left, where the occipito-mastoid 
suture meets the posterior border of the jugular foramen. 

Og, Breadth of the pars basilaris, taken between the legs of small 
calipers as far forward as possible. 


Sphenoid 


'/S»i, Chord between right and left krotaphions. 

/S 4 , Chord between the extreme posterior points on the right and left 
of the sphenoid exposed on the base of the skull. 

/S 7 , Chord between the lowest points, right and left on the suture 
^ between the medial pterygoid plate and the palate bone. 


Four of these are familiar ethnometric characters and we have placed against 
them in curled brackets the symbols used to denote them in the Biometric 
JLaboratory memoirs. They are the sole cranial characters common to the present 
morphometric and the earlier ethnometric researches of that Laboratory. 


We have tabled the constants of these thirteen additional characters in Table I, 
p. 428. 


( 3 ) Variation of the Bixty-three Characters. 

We are now able to consider the variation of our 63 characters taken as a single 
group. In Table II (p. 429), we have collected the variation constants: in column (a) 
the bone; in column (h) its character; in column (c) the absolute variation of the 
character; in column (rf) the relative variation, or percentage variation on size of 
character; in column ( 0 ) the characters of the bone in order of intensity of absolute 
variation; in column {f) the order of intensity of relative variation, arc provided. 
In the last two columns the order of absolute sizes of the characters are placed for 
each bone character in square brackets. When we examine column ( 0 ) we see that 
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TABLE I. Constants of the Non-Bilateral Characters used in this Paper. 
(This Table must be taken in conjunction with Table I of the former Memoir^.) 


Bone 

Character 

(cases) 

Means 

Standard 

Deviations 

Coefficients of 
Variation 

Frontal 

/’sCSOl) 

/’6(888) 

104-5839± -1447 
127*2100±*1413 
139-5224 ±-2181 

6*4047 ±*1023 
6*2372 ± *0999 

9 *6366 ±*1642 

6*1240± *0982 
4*9031 ± *0787 
6*9061 ±*1110 

Parietal 

A (887) 

128*3726 ±*1651 

7 -2475 ±-1167 

6*6466 ±*0912 

Occipital 

0, (864) 

O 2 (882) 

O 3 (870) 

O 4 (876) 
Or.{»74) 

0. (864) 

115-0093±-1574 
36-7330 -0592 
26-3397 ± -0634 
128-0791 + -1378 
73-2154 ±-0818 

21 -8694 ±-(>418 

6*8604±*1113 
2*6047 + *0418 

2 *3343 ± * 0:177 
6*0422 ±*0974 
3*5874 ±*0579 
1*81 99 ±*0295 

5*9661 ±*0972 
7*2894±*1177 
8*8626 ±*1444 
4*71 76 ±*0762 

4 *8998 ±*0792 
8*3256 ±*1362 

Sphenoid 

Si (732) 
5,(881) 

S, (822) 

' 114*6027 ±-1332 
70*3871 ±*0817 
35*4720 + *0687 

6*.3447±*0942 
3*5943 ±*0678 
2*9187 ±*0486 

4*0677 ±*0825 
5*1005 +*0823 
8*2281 ±*1378 


the absolute variations in the characters of each individual bone, even allowing for 
random sampling, are not proportional to the absolute size of the organs measured. 
Thus it is demonstrated that the coefficients of variation will not be constant — any 
more than the standard deviations for a single cranial bone, much less for all the 
bones, i.e. the coefficients of variation must depend on the size of the character 
measured. 

At the same time it will be clear that there is considerable positive correlation 
between the intensity of the absolute variation and the size of the varying character. 

This is shown by our Table II where the intensities of the variation are arranged 
in decreasing order, and against them are put the sizes of the corresponding organs, 
[1] marking the largest measurement, [2] the second and so on for each bone. If we 
now average the bracket numbers for the upper and lower halves of the group of 
characters for each bone, we find that for the six bones — frontal, temporal, sphenoid, 
malar, maxillary and occipital — these averages are much lower for the greater in- 
tensities of variation, only in the case of the parietal are they equal. In other 
words, the absolute variabilities of the larger-sized characters are the larger. 

The last column (/) shows, however, that the coefficient of variation overcorrects 
this defect; for if we except the frontal where equality exists, and the short malar 
series of only four characters, where the two larger coefficients of variation belong 
to the two large-sized characters, then in the case of all the remaining five bones 


JHmnetrika, Vol. xxii. pp. 830 — 331. 














TABLE II. Farta6^%, Ahaohste and BakMWt ofiihe%% Morphometric Oharaotcrei 


(«) 

Bone 

(&) 

Oharaoter 

standard 

Ckkoi^oient 

Onler o/ Absolute 

Order o/'Aelative 

Deviation 

of Variation 

Variation 

Variation 



6-10 

6*18 


[1] 

Mean First 

Fs 


Mean First 

1 

5*06 

6*16 

‘3 

Four S*S0 

Fs 

‘3 

Four 4*^5 


4*78 

5*43 


‘2 


(Fs{L) 

7‘ 

\L 

4*98 

6*69 

SFi («) 

[4' 


\Fs{R) 

’6' 


£ 

Fz 

6*40 

6*12 

\Fi (Z) 

6' 

Moan Last 

JFUP) 

'4 

Mean Last 



6*24 

4*90 

jFs (Z) 

;7; 

Four 5*50 


6* 

Four 4 'BS 



9*64 

6*91 

\F, {U) 

[6] 


Fs 

[2] 


Pi 

7-26 

5*66 

P\ 

[5] 

Mean First 

Pi 

[6] 

Mean First 


r> 

6-93 

5*27 

Pz{L) 

4’ 

Four 4'^^ 

(PUP) 

6 

Four 5*50 

1 


6-70 

5*13 

PaW 

2 


XPzW 

’7’ 


1 

P j 

5-92 

3*58 

PziP) 

6 


(PzW 

4 



6-04 

3*70 

PsiP) 

3* 

Mean Last 

\Pz{P) 

■3 

Mean Last 


p 

5*84 

3*30 

PUP) 

1' 

Four 4'^^ 

]PUL) 

’2 

Four 8’50 



6*96 

3*39 

PziL) 

7] 


\PUP) 

[1] 



T 

3*89 

4*48 

\^(p) 

[2^ 

Mean First 

iPUP) n 

L21 

Mean First 


3*83 

4*46 

1 

Seven 4^71 

1 

L1‘ 

Seven 10‘14 


T / ^ 

3*78 

8*11 

}Tz(P) 

6' 


TUP) .1 

13 



3*71 

7*93 

\TziL) 

6 


T.t(H) 

■s; 



(yr J P 

4*27 

6*45 

(TiiS) 

'3 


•J'.i.L) [ 

14^ 


'g 

4-25 

6*47 

\Ti W 

4‘ 


tul) 

(TsW 

7 



T J 

4*30 

4*31 

fTe(P) r 

L2 


6^ 


B 


4*48 

4*50 

Xnw [1 

LI’ 

Mean Last 

\Ts(B) 

5 

Mean Last 


rp \ P 

2*75 

5*98 

JTUP) 

‘S' 

Seven I(P^9 

{TUB) 

'9’ 

Seven 4*7i 



2*77 

6*01 


7 


\TUP) [ 

10 


m J 

3*79 

8*33 

J Pr (P) [ 

L3 


TtW 

? 



^«\Z 

3*78 

8*24 

iTiW 1 

14 


(TUP) 




fp 

2*94 

2*86 

8*14 

7*94 

/n(Z) [9 

\T,{R) [10] 






>S’i 

5*34 

4*67 

^'i 

11 

Mean First 

Sr n 

1] 

Mean First 


« f/i 

3*43 

4*57 


4 

Six 5*50 

f Sq (Z) [ 

10 

Six 5*00 


^Hz 

3*50 

4*68 

S., (Z) 

3' 


\SUP) 

.9; 




1*93 

5*33 

SSsW 

6 



6^ 


‘o 

2-01 

5*51 

\SUP) 

6" 



‘6 


s 

^4 

3*59 

6*11 

SUP) 

2 


(Ss{L) 

7' 


pL 


3*44 

6*07 

S 7 [ 

11 

Mean Last 

\Ss{P) 

’8 

Mean Last 

CZ2 

3*45 

6*12 


10 

Six 7*55 

Ss 

■4' 

Six 4*56* 



2*40 

6*74 

\Se{P) 

9 


SUB) 

'3' 


2*44 

6*87 J 

tSs{L) 

'7' 


Si 

1' 



Sr 

2*92 

8 23 

\S3(iC) 

[8 


SUP) 

2; 




4*38 

7*37 

(m, (Z) 

ri' 

Mean First 

(Ml, (Z) 

T 

Mean First 

M 

4*56 

7*64 

\Mlr{P) 

2 

Two i*50 

\Ml, (P) 

'2' 

Two 7*50 

eg 

tsi 


3*11 

6*30 

(MlUL) 

3' 

Mean Last 

(Mis (B) 

3 

Mean Last 


3*21 

6*42 

\MlsiP) 

[4 

Two 5*50 

\Mls(P) 

;4; 

Two 5*50 


Mx, 

3*03 

5*90 

(Mxi (P) 

ri 

Mean First 

( Mxs (Z) 


Mean First 


3*90 

5*88 

XMjciCL) 

2 ' 

Four 3‘dO 

\Mxs{P) 

6 

Four 0*50 


UxM 

2*87 

5*15 

f Mjps (Z) 

6 


(Mxs (Z) 

’8 


pS 

2*89 

6*17 

\AfjCs (P) 

’6' 


\Mxs {P) 

■7] 


3 

JfxsVl 

3*14 

7*69 

\Mx^ (Z) 


Mean Last 

(Mx, {P) 

■f 

Mean Last 

12^ 

3*26 

7-94 

\M.v^iP) 

4 

Four 5*50 

\Mx, (Z) 


Four j^*50 



2*59 

6*75 

(MX4 (Z) 

8 


(Mxs (Z) 

3 



2*68 

6*99 

\MxUP) 

[7 


\Mxs {P) 

4_ 




6*86 

5*97 

0i 

[2 

Mean First 

Os [ 

11 

Moan First 


0^ 

2*60 

7*29 

0 , 

’1 

Six 4'17 

Os 

12' 

Six 7*07 


Os 

2*33 

8*86 

lOrW 

> 

Os t 

10' 



Oa 

6*04 

4-72 

XOr {P) 

4 


0i 

2 


3 

0, 

3*69 

4*90 

0s{P) 

'8 


0s(P) 

‘8 



Os 

1*82 

8*33 

Os 

7 


{OUB) 

\0r(P) 

'3' 



{z 

6*34 

5*49 

i<h{P) 

\OUL) 

'6 

Mean Last 

'4] 

Mean Last 


6*54 

6*61 

:6; 

Six 8 'SS 

0s{B) 


Six 5*55 



3*69 

6*78 

OsW 

9 


Oa 

7 




3*29 

6*20 

Os [ 

10 


Oi 

>] 



o 

^»\z 

3*48 

4-60 

0 , r 

11 


{OUR) 

\0UB) 

6‘ 



3*30 

4*43 

O 0 [ 

12' 


:6] 



* Unfortnnatelj in Dr Woo*s paper {BiometHkat Tol. xxn. p. 880 , Table I, and Ml^ should be interchanged 
all the ooastante of the former are larger than those of the latter. 
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the moiety of the coefficients which are larger give a larger average of the square 
bracket numbers or correspond to characters of lesser size. In other words, the 
coefficient of variation of a character is not the same for all characters but roughly 
varies inversely as its size. 

These results appear to indicate that if variation in the rates of bone growth be 
different from individual to individual, these rates cannot be the same for all 
directions of growth in the same bone. In other words, a bone in its growth must 
change not only its size but its shape. If for an individual, the individual bone 
changed its size only, not its shape, then it seems reasonable to suppose that the 
coefficients of variation would be the same for all measurements on the same bone. 
Ifc seems then highly improbable that bones grow isotropically, or if growth from 
a centre of ossification be isotropic, the presence of more than one centre causes the 
growth of the bone as a whole to be hcterotropic. 

We can obtain some rough appreciation of the influence of absolute size on 
absolute variation by calculating the correlation coefficient p of order of size and 
order of variation for each bone and pooling these orflers. We find that p « *2687 
and, by the formula r = 2 sin (^Trp), deduce 

r = *2804. 

This average correlation for the seven bones is a rough measure of the influence 
of the size of a character on its absolute variability ; it is not so large as we might 
have anticipated. It would have been greater had we pooled all the place measure- 
ments together, but it seemed desirable to free ourselves of such great differences 
of size as occur, for example, between our sphenoidal and parietal measurements. 

If the same method of correlation be applied to the absolute sizes and the 
coefficients of variation we find p = — '1138 and r = — *1191. Thus the relative 
variation exhibits over 50 less correlation than the absolute variation does with 
absolute size, but it has overcorrected. 

To sum up: The correlations of absolute size with both absolute and relative 
variability as represented by the standard deviation and the coefficient of variation 
respectively are small but significant in the case of measurements taken on the 
individual bones of the human skull. The correlation of absolute size wdth the 
standard deviation is positive, and with the coefficient of variation is negative; the 
latter is considerably smaller than the former, but the distribution of the total 
variation over the mean length overcorrects. So far no one has suggested a constant 
which will give a better measure of relative variation than the coefficient of 
variation*. It may be that a better exists, but it is more probable that variation 
is so peculiar to each character that it is not possible by the examination of a single 

* It might be thought that a better ooeffioient could be found by taking the ratio of each deviation 
to its own length. Let c be the character, c its mean, cr^ its standard deviation, v==(rjc and l^=;100v 
its coefhoient of variation. Then instead of V'=100<r«,/c let us take 

^=100 a/s M 
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constant to assert whether A is more variable than Only when we compare the 
coefficients of variation of the same character in two sexes or two races can we get 
a partial answer to the question. 

It remains to be considered whether it is possible to discover whether there are 
any special directions of measurement on the bones of the skull, transverse, sagittal 
or vertical, which are associated with larger relative variabilities. The difficulty 
here is that some of the measurements, especially some of the arcs, have very mixed 
directions. 

The following classes were made: 

Sagittal, i.e. practically parallel to the Frankfurt horizontal and the median 
sagittal plane. 8t, sagittal transverse, in which were included t(8)y transverse, with 
a smaller amount of sagittal in it, and 8(t), sagittal, with a smaller amount of 
transverse in it; sv, sagittal vertical, with which were included «(v) and v(8), 
sagittal with a less amount of vertical and vertical with some small amount of 
sagittal; and finally sot, a measurement combining all three directions. Thus 
for sagittal measurements we have s, st, sv, stv. In a similar manner we have for 
transverse measurements ts, tv and tvs, and for vertical measurements v, vs, vt 
and vst The order of the letters in these classes is naturally indifferent. One or 
other of these classes was to the best of the writer’s judgment affixed to every one 
of the 63 characters; and then the position of the coefficient of variation in each 
bone class taken and placed under the direction class of its measurement. These 


where N is the total number of observatione. Then 


= S 1 Ac=c - c 

N (cfAc)® 


where the /3’s have the customary statistioaJ significance. For a symmetrical system VjSj and 
etc. =0, and for a normal distribution /9a=3, /84=15, and then 

P=100t)(l + 9»»-f 

if we neglect terms in But to the same degree of approximation 

1 +0d»+ 75«>*= (l - , 


or, finally, 


F=F- 


( 1 ?? — 

V 80,000 ) 


n 


Thus, though F may be theoretically more satisfactory than F, it is clear that F can only be constant 
as V becomes constant, and F is easier to compute than F. Indeed it is diilicult to think of any other 
lengths than c and for a normal frequency which could give a ratio possibly independent of absolute 
size. It may be noted that when F is small, F will be almost identical with F in value, and that when 
F is even as high as six, F will only differ from F by about 1 to 2 per cent. 
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positions were then averaged for eskch class. As the positions denote descending 
order of intensity of relative variation, a low average position (a great number for 
the mean position) marks leas variation. There resulted the following scheme: 


(i) 


(ii) 


(iii) 


Sagittal or in part sagittal measurements. 


Class 

8 

St 

sv 

stv 

All 

Mean Position 

7-333 

5*692 

5*471 

6*111 

5*909 

Transverse or in 

part transverse measurements. 



Class 

t 

ts 

tv 

tvs 

All 

Mean Position 

6*000 

6*692 

4*833 

6111 

5*688 

Vertical or in part vertical measurements. 



Class 

V 

vt 

vs 

vst 

All 

Mean Position 

3-876 

4*833 

5-471 

6*111 

6*179 


This tabulation gives a uniformity of results far more decisive than anything which 
might have been anticipated. Whether we judge by the pure directions, v, or 
the total of measurements in which one of these directions appears, the order is the 
same; sagittal measurements are on the average least variable, vertical measure- 
ments most variable, and the transverse intermediate 7*333 — 6*000 — 3*875, or 
5*909 — 5*688 — 5*179. And again when we have a bi-directional class st is less 
variable than and ts less variable than tv. Again in group (iii) v, the most 
variable direction, is more variable when combined with t than when combined with s, 
because t is more variable than s. 

These results seem undoubtedly suggestive; the individual bones of the skull 
tend to vary most in the vertical, less in the transverse and least in the sagittal 
direction. 

If we venture to attribute individual deviations from the type to individual rates 
of growth, then the want of constancy in the coefficient of variation may be due to 
the fact that the growth of a cranial bone is not the same in the three fundamental 
directions, sagittal, transverse and vortical. Thus we may conclude that the coefficient 
of variation is likely to be most useful when we are comparing the same measure- 
ment in two races or in the two sexes. 

If we attempt the same method on the absolute variations, even after “placing** 
by the separate bones and not by the skull as a whole, we do not get the same clear- 
cut result, the correlation between absolute size and absolute variation is too great. 
We find: 

(i) Transverse or in part transverse measurements. 


Class 

t 

ts 

tv 

tvs 

All 

Mean Position 

6-600 

6*000 

4*667 

3*833 

5*233 

Sagittal or in part sagittal measurements. 



Class 

8 

st 

SV 

stv 

All 

Mean Position 

6167 

6*000 

6*684 

3*833 

6*886 
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(iii) Vertical or in part vertical meamirements. 

Class ... V vt vs vst All 

Mean Position 4‘50() 4*667 6*684 3*833 5*487 

Thus, if judged by pure vertical lengths, the absolute variation is greater in 
the vertical than in the other two directions, but stands only second if we judge by 
all measurements which have a vertical element. Again the sagittal length if judged 
by the pure sagittal lengths is more variable than the transverse, but less variable 
if judged by all the measurements which have a sagittal element. In neither case 
do we get the same uniformity in the bi-directional groups as when we deal with 
the relative variabilities. Thus there appears to be a more systematic relationship 
among the relative than among the absolute variabilities. There is no full clue at 
present to the origin of the differences among the latter. 

Thus judged by uni -directional averages the order of increasing absolute 
variability is transverse, sagittal and vertical, but judged by all variabilities in 
which these directions occur it is sagittal, vertical and transverse. Nor are the bi- 
directional place averages arranged in any satisfactory system. Accordingly^ while 
the relative variabilities do fit a more or less consistent arrangement, the absolute 
variabilities beyond a moderate correlation with absolute size do not provide us 
with any clue at present to the source of the differences, and the apparent (if 
possibly not real) irregularities in their values. 

(4) On the Indices of Individual Cranial Bones, 

The large number of indices which could be formed, namely : 21 for the frontal, 
21 for the parietal, 91 for the temporal, 55 for the sphenoid, 6 for the malar, 28 for 
the maxillary and 66 for the occipital bones, or a total of 288, rendered some 
selection needful. First, some 29 indices were chosen and the correlation tables 
formed. Among these were twelve pairs of indices for homologous characters and 
five for non-bilatoral indices. These indices, their means and standard deviations, 
are given in Table III, p. 434, while in the last column of the table are placed the 
correlations of the corresponding indices of homologous bones. 

We may first inquire whether the size of the index has any influence on its 
variation. In Table IV, p. 435, these indices are arranged in order of size, and 
we shall discuss their variabilities from this standpoint. 

If we take the first 15, or greater indices, we find an average standard 
deviation of 5*96, and the last 15, or lesser indices, one of 5*58, on the basis of 
which we can make no definite statement. This view is confirmed if we remove 
the five non-bilateral indices which give a mean of 7*26, and average the first 
12 homologous indices giving 5*68 and the last 12 giving 5*30. Again, if we 
compare the 12 pairs of homologous indices, the variations in the pairs are very 
close ; but in six cases the larger and in six cases the less are associated with the 
greater variation. In view of these facts it is impossible to suppose that size 
of index is at all associated with intensity of variation of index. The coefficient of 
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I’ABLE III. Constants of the Individual Bone Indices. 


Bone 

Index and Number 

Mean 

Standard 

Deviation 

Correlation of 
corresponding 
indices of 
homologous bones 

Frontal 

\OOF 3 IFi f88(J) 
\OOFf,IFi (886) 

XOOF^fFj (883) 

82*4187 ±-l 216 
109-7822 ± *1691 
89*4779 ±*0903 
89*3885 ± *0943 

5-3639+ *0859 
7-0207 ±-1125 

3- 9797 ±*0639 

4- 1531 ±-0667 

]-+-B171±-0076 

Parietal 

lOOPii/A (743) 

100/’;, /A (737) 

88*1 682 ±*1472 
87 -0283 ±-1389 
93*3989 ±*0760 
92*9688 ± *0760 

5*9507 + *1041 
5-6125 + *0982 
3-0181 ± -0630 
3*0595 ± -05.38 

1 + *7892 ±*0093 

1 + -4057 ± *0207 

Toiuporal 

100 T’a/r,; (843) 

ioo7’„/n(82r.) 

70-5(535 + *1440 

7 1*3737 ±*1442 
99-3649 ±*1913 
1(KX*1915±-1909 

6-2264+ -1023 
0-2058+ -1020 
8*1484+ 1353 
8*1305 + *1350 

1 + *7934 ± *0086 

1 + *6186 ±-0145 

Sphenoid 

lOO/Ss/^S'i (723) 

100,S„/,S7 (812) 

80*7510±*1603 
80*3195 ±*1621 
100*6478±1834 
100-.5961 ±*1858 

e*3895±1134 
0*4642 ±-1147 
7-7478 + -1297 
7*8508±*1314 

1 + *8822 ± *0066 

1 + -9048 ± •C)043 

Malar 

lOOjfia/Afi, (716) 

83*1522±*1475 
83*9902± *1518 

5-8490 ±-1042 
0-0232± -1073 

j- + -8806 ±-0067 

Maxillary 

IOC) (508) 

100 3/.r3/J/^i (414) 

83-6063±16]4 
84*2008 ± *1622 
61*1908±*1101 
61 -3768 ±*1133 

5-3937±-n41 
5-4193±1147 
3*3215 ±*0779 
3-4180 ±*0801 

1 + *9344 ± *rX)38 

1 + -7894 ±*0125 

1 

Occipital 

100 Go/f>s («h7) 

100 6 ) 4 / (850) 
10005 /( 0 ^ 4 - 03 ) (847) 

IOOOh/Ot (855) 

100 0,/0o(849) 

i 83-4959 ±*2209 
110*8271 + -1623 
117 *8642 ±*1674 
65*6947 ±*1060 
64*4655 ±*1000 
85*6(X56±*1242 
86*0942 + *1187 

9-6883+ -1662 
7*0150±*1148 
7-2216±*1184 
4*5929 ± *0749 

4- 3342 +*0707 

5- .3660± -0878 
5*1271 ±*0839 

1 + *6044 ±*01 46 
| + *4769± *0179 


variation, if indeed it had meaning in the case of indices, would thus not be steadier 
than the absolute variation as measured by the standard deviation. The cause or 
causes of such diversity of variation ranging from 3*0 to 9*6 units must be sought 
elsewhere than in size of index*. 

* Dr G. M. Morant suggests that the order of laying down the bones and the period during which 
they grow — depending to some extent on their size — may provide a partial explanation of the diversity 
in variability and in the intensity of the effect of homology in the case of these indices. He points out 
that AfjTji, which is an indirect measure of the dental arcade, is probably determined at an early age, 
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Let us look into two further questions which can be answered from Tabl^ III. 

The mean of the indices in Table III on the right side of the skull is 83*47, 
and that of the indices on the left side is 83*42, whence it is clear that there is no 
lateral dominance of the indices as a whole. Pursuing the same simple method, 
we find 

Mean standard deviation on the right side == 6*499, 
and Mean standard deviation on the left side = 5*483. 

There is thus clearly no lateral dominance in variation as a whole. Thus by the 
use of indices we have largely freed ourselves from lateral dominance, either in 
size of index or magnitude of its variation. 


TABLE IV. Twenty-nine Indices of Cranial Bones in order of Size, 


Order 

Index 

Standard 

Deviation 

Order 

Index 

Standard 

Deviation 

J 

loorvco.+o,) 

7*22 

16 

(L) 

6*42 

2 

3 

100 O 4 /O 1 

100 /'V>4 

7-02 

7 02 

17 

18 

lOOMx^/ifxi (R) 

100 Mltl Ml (Z) 

5-39 

6*02 

4 

iooaV'St (*) 

7-75 

19 

lOOOyOj 

9*59 

5 

ioo.v*7 (L) 

7*86 

20 


.^•85 

0 

\Q 0 TJT^{L) 

813 

21 

100 A/a 

5*36 

7 

100 TJTUR) 

8-16 

22 

looA/AW 

6*39 

8 

100 /> 3 /P 4 (/f) 

3-02 

23 ! 

100 a/a (Z) 

6*46 

0 

100 A/ AW 

306 

24 

100 A/r, (Z) 

6*21 

10 

100 a/a (iK) 

3-98 

2r» 

1007’,/A (A 

6*23 

11 

iooAyA(/') 

4-15 

26 

iooa/A(«) 

4*59 

12 

looA'A(^) 

5-95 

27 

loooyOiCA 

4*33 

13 

iooa/AW 

5*61 

28 

100 MxalMx, (/i) 

3 *.32 

14 

100 6 » 8 / 0 „(A 

5-37 

29 

lOOMxs/JfXt (Z) 

3*42 

If) 

lOOOs/ACA 

5-13 




We will now arrange our homologous indices in the order of intensity of their 
correlation coefficients, see Table V, p. 436. 

These give an average correlation of homologous indices of *7414, marking 
a high relationship due to homology after we have removed the lateral dominance. 
The sources of the differences in the intensity of homologous influence on the 
various bone indices is not easy to discover*. It is true that all the indices do not 
belong to the same class; in the second, third and eighth indices in Table V, 

and that and P 4 are the greatest lengths measured. The index of one stands at the top of the 
homology order and variations of the maxillary indices are on the average small. The index P 3 /P 4 
stands at the bottom of the homology order. It may, however, be noted that its variation is the least 
in Table IV. The sphenoid is one of the earliest bones to be laid down, and its indices stand second and 
third in the homology order. There will obviously be discordances in the interpretation of Tables IV 
and V on this basis, but investigations might prove profitable when the authors have devoted more 
time to a study of what is known regarding the development of the skull. 

* Bee however the preceding footnote. 

28—2 
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84 , = distance from krotaphion to krotaphion, and 8 ^ = distance between the lowest 
points right and left on the suture between the medial-pterygoid plate and the 
palate bone, are transverse lengths crossing the median sagittal plane; and again 
Pi is the common arc between the two parietal bones. Hence these three indices 
differ from the remaining nine as not being composed of homologous lengths on 
opposite sides of the skull, but have the denominators in the pair of correlated 
indices the same. But this cannot be said to throw much light on their relative 
positions in the above table. In choosing the ratios of bone lengths some attempts 
were made to take them so that we should obtain a rough ratio of length to 
breadth, this accounts for indices like T%IT^, Pji/Pi, 

T^IT^y ObIOi and OqIOb. But these eight indices show great differences in their 
correlations, and are in no way classed in position together. P 2 , Pi, Pe* I&t Ob, O 7 , 
Oe end chiefly at cranial points, but we are far from certain that that gives them 
a reduced homologous correlation, and as a matter of fact the lengths of the first 
six indices are in very much the same condition as to their terminals. 

TABLE V. Order of Correlation Goejficients for Homologous Indices. 


Order 

Index 

Correlation 

Coefficient 

Order 

Index 

Correlation 

Coeffioient 

1 


+ *9344 + 0038 

7 

iOOMxsjMwy 

+ •7894 + ‘01 25 

2 

lOO^V-SV 

+ •9048 + -0043 

8 

ioo/v>’i 

+ •7892 ±-0093 

3 


+ *8822 + *0056 

9 


+ •6186 + -0145 

4 

\QOMl 2 IMl 1 

+ -8806+ -0057 

10 

lOOOg/O, 

+ -6044 + -0146 

5 

100 /^VFi 

+ •8171 ±‘0075 

11 

1000h/«„ 

+ ‘4769+ 0179 

« 

100Tb/7"s 

+ •7934 ±‘0086 

12 

100/yA 

+ ‘4057 + -0207 


All, we think, that at present we are justified in concluding is, that homology 
plays a large part in determining the shape as well as the size of homologous 
bones, but that while this influence has very significant differences as between one 
bone and a second, we cannot so far state why homology should be much more 
active in the maxillary or sphenoid bones than in the parietal or occipital. Far 
more correlations of homologous indices must be worked out before light can be 
thrown on this point. 

We have seen by the mere process of averaging that there is no dominance 
of either side of the skull in variation as a whole. But this does not exclude 
dominance in the cose of individual bones. Table VI indicates that such dominance 
most probably exists in the case of two indices, although the ratio to its 

probable error is not “markedly” significant. 

In the case of both bones, where dominance of indicial variability is probably 
significant, that dominance is on the right side. But before we consider this point, 
it is worth while to inquire whether there is significance in the series of twelve 
ratios as a whole \ for this purpose we apply the P^n test. Multiplying the last 
column by *67449, we shall have the ratio of the deviation to the standard deviation 
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TABLE VI. Difference of Absolute Variability for Homologous Indices, 


Category of 


Difference and its 

Ratio of Difference 

Significance 

Index 

Probable Error 

to its Probable Error 

of Difference 


A«_z,=*=p.e. of Ai? L 

A/f_/,/p.e. of 

Probably 

1CX)A/A 

+ •3382 ±-0880 

+ 3-84 

significant 

1000«/07 

+ •2587 ±-0821 

+ 3*16 

Possibly, but 


- -1741 ± *0709 

-2-46 

far from definitely, 

]00 0k/«» 

+ •2389 ±-1068 

+ 2-24 

significant 

XQOF^IFi 

-•1734 ±-0865 

-2-03 


\00Mx^lMxx 

-•0966 ±*0686 

-1-41 


1 00 (Sfl/ 

-•1030 ±*0786 

-1-31 

Most |.)robably 
insignificant 

loo/VA 

-•0747 ±-0769 
-•0414 ±-0690 

- 0*98 
-0-60 

lOOvt/a.ji/J/'x, 

- -0256 + *0676 

-0-44 


100 n/n 

100 TJT:. 

+ •0197 + -0879 

+ 0-22 


+ •0178 + -1601 

+ 0-12 


of that deviation, and the probability integral tabic of the normal curve* provides 
us at once with the probability integrals of the twelve values. In Table VII the 
corresponding logarithms and their sums are provided. 

TABLE VII. 

Criterion for the General Significance of the Twelve Ratios in Table VI. 


Order 

Deviation 


logjp 

y = l -p 

log p 

H.D. of Deviation 

P 

1 

+ 2-690 

•99520 

J -997,8231 

•00480 

3-681,2412 

2 

+ 2-125 

•98321 

f-992,6463 

•01679 

2-225,0507 

3 

-1*659 

•04856 

2*686,2787 

•96144 

1-978,3814 

4 

+ 1-511 

*93460 

1 -970,6258 

•06640 

2-815,5777 

5 

- 1 -369 

•08550 

2*931,9661 

•91450 

1*961,1837 

6 

-0*951 

•17080 

1-232,4879 

•82920 

1-918,6593 

7 

- 0-884 

*18835 

1-274,9656 

•81166 

1 -909,3688 

8 

-0*661 

*26431 

1*405, .3634 

•74569 

1-872,6683 

9 

-0-406 

•34274 

1*534,9048 

•05726 

1*817,7372 

10 

-0-297 

•38323 

1-583,4596 

•61677 

1-790,1232 

11 

+ 0*148 

•55883 

i -747,2797 

*44117 

1-644,6060 

12 

+ 0-081 

•53228 

l^gio X„ = 

1-726,1401 
- 5-915, 9989 

•46772 

b>gi() X'„ = 

1-669,9859 
: -6*716,5266 


^Jn logio C-* Vl2 X -434,2946 «= 1 -604,4403. 

w =* - logio logio c) = 3*9324, w' = - logio X'n/( V n logio e) — 4*4638. 

* The true curve of distribution of the standard deviations of samples from a normal curve is given 
by Helmert^s equation — a Typo III curve, but in the case of a large sample this passes into a normal 
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Hence Qxn = 1 ~ ^ (H. 3-9324). Qv« » 1 - / (11, 4-4638). 
where I (jj, u) is the gamma integral ratio as tabled in the Incomplete T~Function 
Tables, Since smaller than Qa„, it suffices to consider its value. 

Thus we have Qv„ = *1562, 

and this does not indicate any great improbability of the twelve differences being 
due to random sampling from a population of no difference between -R and L. 

It is possible therefore when treating the twelve differences as a whole to consider 
them as random samples from populations having equal variation, notwithstanding 
that treated individually the first two differences might be considered as probably 
significant. This p)oint — i.e. of the difference between a single A/(p.e. of A) 
and a whole series of such values — has too often been neglected in craniometric 
memoirs. 

(6) Comparison of the Influence of Homology on Index Correlations and 
Absolute Lengths, 

Returning to Table V, we recall the fact that the average correlation of the 
twelve homologous indices is *7414. For the twenty-five absolute homologous 
lengths, measured on the separate cranial bones, it is *8198. Thus on the average 
the absolute homologous lengths were more closely related than the indices, i.e. 
the sizes of the bones were more closely related than their shapes. 

We may analyse this result a little further in the following table : 


Correlation of Homologous Characters, 



Mean Index 

Mean Absolute 

Bone 

Correlation 

Lengths Correlation 

Sphenoid 

•8935* 


•8206] 


Malar 

•8806 


•9309 


Maxillary 

•8619 


•9198 


Frontal 

•8171. 


•9221 


Temporal 

•7059] 


*7894] 


Parietal 

•5974 


•7253 

r 

Occipital 

•5407 J 


•7087 J 


All cases 

•7414 

•8198 


It will be noticed that in both cases, index and absolute lengths, the temporal, 
parietal and occipital bones are the least highly correlated. The facial bones, 

curve, with mean at the modal value of the S.D. But the difference of two quantities each following a 
normal distribution, if divided by the standard deviation of that difference, follows Student’s z curve, 
which again passes into a normal curve for large samples. Since each pair of homologous bones has the 
same number of individuals, we may take a series of probability integrals, each corresponding to a normal 
curve based upon different-sized samples for each pair of different homologous indices. 
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malar, maxillary and frontal, form also, in both cases, a ^oup with hij^h cor- 
relations. The sphenoid stands at the top in the index column, and intermediate 
between the two groups in the absolute lengths column. With this one exception 
the influence of homology is more intense in the case of every bone in regard to 
size than to shape ; also, apart from the sphenoid, the order of intensity of the 
influence of homology is much the same for indices and lengths. The greater 
symmetry of the face over the brain box is a noteworthy fact. Possibly it has 
been due to sexual selection ; we notice failure of homology in the face far more 
than in the case of the retroceding calvaria. 

(6) Method of studying the Index Correlations of Individual Cranial Bones. 

In order to study these index correlations, we shall require to classify our 
cranial bones. Such bones may be either on the same or opposite sides of the 
median sagittal plane. If on the same side, they may be either contiguous or 
non-contiguous. If on opposite sides, they may be homologous or non-homologous. 
If non-homologous, they may be contiguous or non-contiguous with their homo- 
logous bone. The consideration of contiguity is a priori of great importance 
because it is not unreasonable to consider that contiguity would have considerable 
influence on measurements of shape*. 

In Table VIII we have classified the cranial bones with regard to their “ con- 
tiguity.” On the basis of these measures of contiguity, and taking into account 
that the components of some of the indices do not possess “ laterality,” i.e. are 
measured across the median sagittal plane or along it, we can divide up our 
correlations for purposes of study into the following classes : 

(i) Correlation of Indices of Contiguous Bones (same side). 

(ii) Correlation of Indices of slightly Contiguous Bones (same side). 

(iii) Correlation of Indices of Non-contiguous Bones (same side). 

(iv) Cross-Correlations. Index of a Bone with the Index of a Bone contiguous 
with the homologue of the first (on the opposite side). 

(v) Cross-Correlations. Index of a Bone with the Index of a Bone non- 
contiguous with the homologue of the first (on the opposite side). 

(vi) Correlation of two Indices of the same Bone. 

(vii) Correlation of the same Indices on Homologous Bones f. 

(viii) Correlation of different Indices oh Homologous Bones. 

* Where the union of two bones is an “overlap” (o.g. temporal and parietal bones) and not a 
serrated suture, the measurements have been taken to the edge of the overlap (e.g. squamous suture), 
the skulls not having been separated into their individual bones. This may have some influence on the 
correlation of the indices of contiguous bones, but it has not been possible to trace it. Nor have we been 
able to find in the index correlations differences associated with ossification in cartilage and development 
in membrane. 

t This has already been discussed: see pp. 436 — 438 above. 
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TABLE VIII. Continuity of Cranial Bones. 


Bone 

Frontal 

Parietal 

Temporal 

Sphenoid 

Oooipital 

Maxillary 

Malar 

Frontal 

1 


Contiguous 

Non- 

contiguous 

Contiguous 

Non- 

contiguous 

Slightly (a) 
contiguous 

Contiguous 

Parietal 

1 

(Contiguous 


Contiguous 

Slightly (b) 
contiguous 

Contiguous 

Non- 

contiguous 

Non- 

contiguous 

Temporal 

1 

Non- 

contiguous 

(CJontiguouK 

- 

Contiguous 

Contiguous 

Non- 

contiguous 

Slightly (c) 
contiguous 

Sphenoid 


Contiguous 

Slightly (6) 
contiguous 

Contiguous 

— 

Contiguous 

Contiguous 

Contiguous 

Occipital 


Non- 

[ contiguous 

Contiguous 

Contiguous 

Contiguous 

— 

Non- 

contiguous 

Non- 

contiguous 

Maxillary 


f Slightly fa) 

1 contiguous 

Non- 

contiguous 

Non- 

contiguous 

Contiguous 

Non- 

contiguous 

— 

Contiguous 

Malar 

j 

1 

Contiguous 

Non- 

contiguous 

Slightly (c) 
contiguous 

Contiguous 

Non- 

contiguous 

Contiguous 

— 


(a) Superior nasiU point to Daeryon, 

{h) From Splienion to Krotaphion. 

(o) On the temporal proceas. 

There are many degrocH of “Contiguity,” but the above notes will beat explain what we mean by “Slight Contiguity.” 

(ix) Correlation of Indices on Contiguous Bones, one Index of the pair having 
no laterality. 

(x) Correlation of Indices on Non-contiguous Bones, one or both Indices of 
the pair having no laterality. 

Under these classes we shall study our index correlations. 

(7) Correlation of Indices of N cm ^contiguous Bones (on same sflde). 

We will start with Group (iii) as most convenient for our present purposes. 

Table IX indicates that we have 40 indfex correlations in our series of 29 
indices — of which 27 have been worked out — when we confine our attention to 
non-contiguous bones of the same side. In this series those of highest value, as a 
rule positive, are between indices of the occipital and parietal bones and the 
maxillary index Mx 2 lMa;i. There seems little obvious reason why a breadth-height 
index of the maxillary should be less closely associated with a height-length index 
of the parietal, and in a positive sense, than with such indices as Os/O? and O^IO^ 
of the distant parts of the occipital. Sometimes, as in the cases of Fi/Fi with 
la/ 2^8 and Og/Og with M^/Mli, the right and left sides give correlations of opposite 
sign. Again, the correlations on the two sides, while differing within the limits 
of random sampling, differ fairly considerably from each other. Without being able 
to give any explanation of why certain correlations were higher than others, we 
note that all the correlations are less than 0*1, and thus perfectly useless for the 
purpose of predicting the shape of one bone from that of a second non-contiguous 
bone on the same side. 
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TABLE IX. Correlation of Indices of N on-contiguous Bones {on same side). 


Index Pair 

Correlation 
and p.e. 

Index Pair 

Correlation 
and p.e. 

O^IO^d), Mjs.il MML) 
P^IP,(a), MjSi/Mx-iiH) 
P^IPt(L), Mv^lMx-AL) 
O^IOt{R), Mx.ilMxi{R) 
OJOr(,L), MXilMxi(L) 
MxalMxi(R} 
TJT^IL), Mx^lMXi{L) 
F.JFi(R), OJO,{R) 
F^FiiL), 0,IOr(L) 
P,lPi(R), MhlMh{R) 
P.JP,(.L), Ml.JMh(,L) 
T‘ilT»{R), MxtlMxi^R) 
T-ijT^lL), Mx^lMxi(L) 
PilPi(R), MxilMxj{R) 
PtlPi(L), MXijMxiiL) 
PslPi(R), MxJMxiiR) 
P^IPiiD, Mx^lMxi(L) 
Ta/T,(R), Mx.il Mx,{R) 
T,IT,{L), Mx.il Mx,{L) 

+ •0674 +*0284 
+ •0819 ±-0283 
+ •0564 ±-0297 
+ •07821 •0248 
+ •07441-0284 

+ -0711 1 -0307 

+ -0677 1 ^0230 

- -0406+ -0268 
- •0652 + •0268 
+ •0376 + -0282 
+ •06321 •0282 
-•06361 •0296 

+ -05321 •0322 

+ •0529+ -0286 

F^IFiiR), TilT^(R) 
F^IF^iL), TilT,(L) 
OJO,(R), MIJM^IR) 
OtlO„{L), MklMli(L) 
T„IT,(R), Mx^lMxi(R) 
Ty^Ci), Mx.JMxi(L) 
FilFi^R), T^R) 
FilFiiD, mr,{L) 
OilOt(R), Mx.ilMxi{R) 
0^0,l^L\ Mx^lMxdD 
P.JPi{.R), MhlMh(R) 

PilPiW, MlilMliiD 
PtlPi(R), MxalMxi{R) 
^P,{L), Mx^lMxdL) 
F^IFiiR), 0^!O„(R) 
F^IFy{L), 0^10, (L) 
0,I0AR), MxJMxiiR) 
O^IO-,(L), Mx^lMxxiV) 
<hlO,{R), MlilMliiR) 
O^IOt(L), Ml.il MliiL) 

4- *0387 +*0230 
- *0071 ± -0230 
+ •0131 ±'0249 
-•0361 ±*0247 
- -0308+ -0311 

+ •0245 ±‘0231 

-•0191 ±*0310 

-•0133 ±*0268 

+ •0092 ±*0323 

+ •0044 + *0232 
+ •0073 ±-0231 
-•0046 ±*0310 

+ •0019 ±-0249 

, 


As there were only 9 negative as against 18 positive correlations, we doubted 
whether the whole series could be treated as a random sample from a set of 
populations with zero correlation in each. Yet it seemed worth while examining 
this. We applied the F^n treating samples (in all cases over 430) as having 
their r’s distributed normally. We found that the more stringent set of the 
probability integrals gave 

Qa'„= 0066, 

or, only in about 7 in 1000 trials would so poor a result arise in such large samples 
from populations having zero correlation between the indices*. Thus we feel 
bound to discard the hypothesis of these indices even in the case of non-con- 
tiguous bones being uncorrelatcd. Yet in the bulk of cases the indices are so small 
and irregular that we do not feel able to give much scope to a ** general control 
factor” regulating the shapes of non-adjacent as well as adjacent bones. 

If we leave out of account the correlations of Og/Og, Og/O? and FsIPt, with 
then we find 

Qvn--n90, 

or, it would not be an unreasonable assumption to suppose the remaining 22 index 
correlations to be really zero. This is a somewhat arbitrary procedure, yet it 
suflBces to indicate how slender the correlations are. Indeed, if we applied a test 

* Besides the hypothesiB made in the text of zero r for very large samples being distributed normally, 

the r must be that of material which more or less closely follows a normal surface. If we adopt the 
assumption, not unreasonable, that the absolute measurements of the cranial bones do so, the indices 
will fail to do so with theoretical accuracy, but all the same with adequate 'practical suilioiency. 
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of thrice the probable error to each result individually, none of them would pass 
as certainly significant. Such a method of testing would not, however, be con- 
vincing when we have to deal, not with a single value, but with a large series 
of correlation coefficients. 

Accordingly, to get a measure of the “general factor” controlling development 
of shape of the individual bones, we may find the mean correlation of the 27 non- 
contiguous bones == + *0197, and ask whether samples from 27 populations having 
this correlation, but their own standard deviations, could reasonably have given 
the 27 observed values ; we find 

Qv««*3452. 

Accordingly, we may conclude that the general factor controlling the shapes 
of non-contiguous bones may amount to about -f *02, while the factor of homology 
gives an average correlation of ’74, or has thirty-seven times the intensity of the 
former. 

The startling conclusion here is the smallness of the relationship between the 
shapes of the skull at non-contiguous parts. 

(8) Correlation of Indices of slightly Contiguous Bones {on same side). Group (ii). 

The coefficients are given in Table X. 


TABLE X. Cory'elation of Indices of slightly Contiguous Bones {on same side). 


Index Pair 

Correlation 
and p.e. 

Index Pair 

Correlation 
and p.e. 

P 2 IP 1 W, S,,l8t(P) 

n/TUP), 

7\IT„(L), MhIMlAL) 
P^IPiilt), SoI 8,(_A) 
P.JPi(L), S^ISiiL) 

+ *1205 ± *0247 

~ •0(374 ± *0250 

+ •0590 ±-0247 

+ -0482+ *0251 

F:IFi(Jt), Mx.ilMxt{/{) 
FJFtiP), Mx^lMxi{h) 
P;IP^{R), 

P^IP.W, SclS.,{L) 

MhlMhiP) 
MINIMI, {L) 

-•0409 ±-0306 

+ *0407 ± -0281 
+ -0093 + *0283 
- -0359 ± -0257 

+ -0113± -0246 
-•0004 ±-0245 


The only value which seems exceptional here is the correlation of the parietal 
P 9 /P 4 with sphenoidal especially when we note the negative correlation 

of P 2 /P 1 with the same sphenoidal index. Assuming that there was no arithmetical 
error in these values, the system of ten correlations was tested as a random sample 
from a zero population. Assuming normal distributions for the indices, their 
correlation coefficients r in the sampling would follow the curve = 2^0 (1 — 

As no n is less than 434, and n can take values up to between 800 and 900, it is 
sufficient to use a normal distribution in each case for r. Calculating the probability 
integrals and applying the P^n test, we found = ‘0086. The ten values can 
scarcely therefore be looked upon as random samples from parent populations with 
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zero correlation. Omitting the first parietal-sphenoid correlation, we found for the 
nine remaining correlations, n * ‘3266, or the slightly contiguous group may in 
the great bulk be treated as without correlation. 

The mean of the correlations of the indices for these slightly contiguous bones 
is -4- ‘0145. We next inquired whether these correlations could be considered as a 
random sample from ten populations having this value for their index correlations, 
but standard errors varying with the sizes of the samples. These sizes being large, 
we again supposed normal distribution of the coefficients and applied the pA^test: 
we found QAn = *2312 and Qa'w = ‘1422 ; the latter is the more stringent, but does 
not indicate that a random distribution of correlations from populations with a 
correlation of *0145 is very improbable. 

It would seem, therefore, that the factor for slight contiguity,” giving a 
standard correlation of *0145, is not definitely greater than that for non-contiguity 
with a correlation of *0197. There appears, accordingly, no reason for separating 
the slightly contiguous from the non-contiguous bones, and we may say that the 
general factor linking the shapes of bones which are not to a considerable extent 
contiguous does not produce a general correlation exceeding *02. 

(9) Correlation of Indices of Contiguous Bones {on same side). Group (i). 

We now come to the study of our first group. From this group we shall be 
able to appreciate the maximum influence of contiguity on measurements of 
cranial shape. We thus reach Table XI, containing the index correlations of bones 
on the same side having fairly appreciable parts in contact. It will be seen at once 
that the condition of things is changed, 13 correlations out of 44 have numerically 
a value in excess of 0*1 and they range nearly up to *16. Their average is — *01638, 
or, say, —*0164*. It is difficult to lay much stress on negative as against positive 
index coiTelations, for the formation of the indices, after the choice of lengths, must 
be more or less arbitrary, and, as a rule, but not absolutely, the choice was directed 
to obtaining an index under 100. However, for the indices selected, the indices 
of contiguous bones give on the whole a negative correlation, or the effect of in- 
creasing contiguity is to reduce correlation (*0197, ’0145, — *0164). If we consider 
the indices regardless of sign, then we find : 

Non-contiguous bones ... ... ... *0397 

Slightly contiguous bones ... ... ... *0434 

Contiguous bones ... ... ... ... *0655 

These results confirm the view that the slightly contiguous may be classed 
with the non-contiguous, but indicate that contiguity gives an association some- 
what less than double non -contiguity, if we neglect the sign of the correlations. 

We will now test whether it is possible to consider our index correlations of 
contiguous bones as a random sample from 44 populations with the correlation 

* The mean of the negative indf3^ correlations is - *0783, of the positive + *0314®, and of the whole 
system without regard to sign •06549. 
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TABLE XI. Correlation of Cranial Indices on Contiguous Bones {on same side). 


Index Pair 

Correlation 
and p.e. 

Index Pair 

Correlation 
and p.e. 


0^/0r(R) 

- *1578 ±-0238 

Sxx/Si (Z), 

MxJMxx (Z) 



PsIPa 

OJOr (L) 

— 

P 2 /P 1 (R). 

Os/oaR) 

- *0540 ±-0244 

MxJMjoi(R). 

MU ML (R) 

- *1217 ±-0290 

P 2 IP, {L\ 

OhIOo(.L) 

— 

\ MxJMx\(L\ MlJMl. (L) 

- *1472 + *0287 

Sxx/R, (/O, 

0»IO„{R) 

+ -0437 ± *0241 



+ *1167+ *0246 

(Z), 

OJO„{R) 

— 


+ •131 8 ±*0239 

SrJR,(R), 

Ml^/Mli (R) 
Mi.,/Mli (Z) 

+ *0436 + *0261 

P2lPl 

(hlO,{R) 

-•1313+ *0240 


+ •0011 ±*0257 

PJPl (/>), 

08/0,(Z) 

— 

Tjr.W, 

Oh/07 (R) 

+ *0420 ± *0234 

p^IPaW, 

OJOaiH) 

- *1237 ±*0240 

ToiniR), 

OJOAR) 

— 


OJOo (L) 

- *12.58+ *0243 


Mx-a\ Mxx ( R) 

+ *0396 ± *0325 

Mh/Mly (R) 

^*1296 ±*0313 

SrjRi (A), 

Mx’JMxi (Z) 

— 

MxJMXy (//), 

MlJMl, (Z) 

nr-AL) 

— 

F^Fi(,R), 

MU Mix (R) 

+ *0368 + *0245 

PJPaW. 

+ *1276+ *0240 

PJPi (R\ 
T2/TARh 
(Z), 

MLJML (Z) 
O 8 /O 7 (k) 

+ -0146+ *0243 

Pia/Pa 

+ •1224+ *0240 

+ •03.50 ±*0234 

Pi/ Pi 

T.JT3{R) 

- *1077 ±*0241 

OH/O 7 (Z) 

— 

Pi/Px 


— 

pjPa{R\ 

TJTAR) 

- *0346 ± *0245 


SiJRa {R) 

- *1001) + -0242 

Ps/Pa {R'U 

n/TAR) 

— 


RJR 4 (/-) 

- *0981 + *0239 

RfJRAiR), 

0„/<>, {R) 

-*0246+ -0248 

FJF, (H), 
fyPi (/>), 

P^/PiiR) 

- 0992 ±*0239 

(Z), 

Og/Oo (Z) 

+ *0076 ± -0245 


— 

n/T,(R), 

Sxx/SAR) 

+ •0205 ±*0241 

TJPrAP), 

OnlOo {R) 

+ *0883 ±*0234 

To/T.U\ 

^61^7 (R) 

— 

Tjn (X), 

On/OAR) 

— 

TJTUR\ 

^\/Sa(R) 

- *01 60 ±*0248 


MxilJtxiiR) 

- *0520 + *0301 

Tjn(L), 

RcJRa (R) 


P 2 IP 1 (/?), 

Mx^ilMxx {jL) 

- 0692 + *0297 

T^ITsiR). 

sjsaR) 

/^//saR) 

n/TAR) 

- *0117+ *0240 

P’JPaW 

P /Pa W 

Mx^/Mxx {R) 

+ *0689 +*0241 

T 2 /T 3 (R), 

— 

P 2 IP 1 (/>), 

+ *0032 + *0243 

P 2 /P 1 (/O, 

+ •0070 ±*0246 

*SV>S^7 {P)y 

- *0657 ± -0309 

P 2 /P 1 {Lh 

nm (Z) 

— 

-%/^\ (Vv), 

Mx^/Mxi {L) 

— 

/sySAR), 

M Li! Mix (R) 

- *0057 ± -0250 

{R) 

+ *0654 + *0234 

RJR 7 {L\ 

MlJMlx (Z) 

— 

Tjn 

O^/Oxx (Z) 

+ •0604 ±*0233 

F2!P^ {R\ 


+ •0044 ±-02.38 

S,/S\ (R), 
*^ 6/^4 (/'). 

Oh/0,{R) 

- *0620 ±*0247 

Fz/PiiR). 

Re/S7(R) 

— 

<hlO^{L) 

— 

s^/RAR). 

0,10, (R) 

-*001 2 ±-0241 


MxJMxi (R) 

-* •0e>17±*0287 


OJO, (Z) 

— 


~ ■0164. Working this out by the P^n tost, we find Q^n = *000,0012, or it is not 
possible to consider that we are dealing with random deviations from an average 
relationship measured by — 0164. This is really clear from Table XI itself, for it 
would appear that the correlation between certain bones is much higher, signifi- 
cantly higher, than that between others. 

To see whether it is feasible to throw any light on the matter we have drawn 
up Table XII, which gives the average correlation between the indices of bones 
on the same side, whether adjacent or not. First values are those regardless of 
sign, the second values in square brackets pay attention to sign. 

Confining our attention here to the mean correlation values independent of sign 
we see that the shape of the parietal has the closest average relationship to the 
shapes of other cranial bones. The highest average correlation with an individual 
bone is that with the occipital, where all the correlations are negative; this might 
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Karl Pearson and T. L. Woo 


446 



Based on Tables IX, X and XL 



446 Morphometric Characters of Individual Bones of the Shull 

be attributed to the length of contour in contact, and such a view might be con- 
sidered as confirmed by the second highest parietal correlation being with the 
temporal, but it is hardly in keeping with its relationship to the only slightly 
contiguous sphenoid or with the far more contiguous frontal. Indeed, while extent 
of contiguity plays a part in these correlations, it is clearly not the only factor to be 
considered. Thus the highest individual average correlation is that between malar 
and maxillary, where all the separate correlations are negative, yet the mere extent 
of contact between the latter pair is hardly in excess of that between parietal and 
occipital, or between parietal and frontal. Clearly the correlation will depend not 
only on the extent of contours in contact, but also on the extent relative to the size 
of the bones in contact. Indeed there are many other factors which may take part 
in determining these correlations — ^not only the original nature of the bones 
themselves, but the order in which they are laid down, etc. What is the really 
important point to be regarded is this: that, while many of the correlations are 
insignificant, some are certainly significant, but they are in the latter case so small 
that any idea of predicting the shape of one bone from the shape of an adjacent 
bone is idle*. As illustration let us take the maxillary index Mac^jMxx and the 
malar index MlzjMlx on the right side of the skull; with the relatively high corre- 
lation — *1217 ±*0290, the prediction formula of the latter from the former is by the 
values in Tables IV and XI: 

Mean^^^^ for array Mean population — *1217 x 83- 1522^ 

In other words, if the maxillary index be raised by a unit above its mean value, 
the malar index would be lowered only by *1320 of a unit, while the standard error 
of a determination from the above formula is 5*8055, or is only some *045 less than 
the standard deviation of the entire population! 

Our general conclusion must be that contiguity does infiuence to a small extent 
the indices of shape of the cranial bones. But the influence is so small and the 
factors determining it so little at present investigated, that very little error would 
be introduced by supposing for the time being that all these indices whether of 
contiguous or non-con tiguo us bones are independent. 

The striking feature of the investigation up to this point is the intensity of the 
homology factor as compared with the adjacency factor. 

If, notwithstanding these general considerations, the reader desires a reasonable 
scheme to represent the whole distribution of contiguous bone indices, we may 

* Tho retider muBt bear in mind a certain apparent paradox: Two adjacent bones will normally 
always “fit,*’ and it might be considered that this indicated a high correlation of shapes. But this is 
not so, the shapes of the two bones follow no general law. Thus, if we take the correlation of the 
parietal and occipital indices, PJPj and =- ’0.540 -0244, which cannot l>e considered 

individually as certainly significant, the mean of the group of occipital indices corresponding to a 
given parietal index will scarcely differ at all from the mean occipital index value ; thus there is no 
sensible association of one index with a second. 



Karl Pearson and T. L. Woo 


447 


consider whether treating all the correlations as of one sign, but with their individual 
standard errors, they could be reasonably represented by a random sample from 
a population with correlation equal to the average numerical value *0655. The 
computing labour is somewhat lengthy, and since the standard errors are nearly 
equal might reasonably have been lightened by treating them as the same, and 
equal to their mean, but to avoid a not unusual type of criticism, they have been 
kept distinct. The numbers on which the correlations are based (see Tables I and 
III) are very considerable, and the correlations themselves are small and many 
insignificant; we have thus no hesitation in treating their distribution as normal. 
The Pxn li^st being used, the probability integrals and their logai;ithms were 
obtained for the two opposite directions, with the following results : 


Probability Integrals: From left to right 
logio Xn == - 22*683,6667, 

JTi loge 10 - 755 log, 1 0 - 2*880,7838, 
u = 7*87413, 


From right to left 
logio X'n = - 22*639,2892, 
= 2*880,7838, 
u' = 7*85873, 


J(43, 7*87413), 


= /(43, 7*85873). 


Since n is greater than 30, Miss F. N. David's Table* cannot be used, and resort 
must be had to the Tables of the Tncomplete V-Function’\‘. Hence we find: 

P,^ = -8888, Px',,«*8863, 

Qa„«1112, Qa„=1127. 


There is scarcely any difference between the two sets, or if the parent populations 
consisted of a series of surfaces of correlation p== 0655 we should obtain, once in nine 
random samples, a sample more improbable than the observed one. This docs not 
justify us in asserting that the 44 index correlations of contiguous bones were so 
obtained, but it does justify us in stating that a moderately reasonable graduation of 
the observed results may be summed up in random sampling from a parent population 
of correlation coefficients with mean p = *0655. And this gives far greater definition 
to the system than the simple statement that the mean of the correlations is *0655. 


(10) Cross-Corr elutions. Groups (iv) and (v). 

Some remarkable results flow from these cross-correlations, indicating how great 
is the influence of homology and how small that of contiguity. We divide our 
indices into two classes, namely : 

A. The correlation of an index of one bone with the index of a second bone on 
the opposite side contiguous with the homologue of the first bone. The values of 
such correlations are given in Table XIII, p. 448. Out of 66 possible values, owing 
to the great labour already spent in making correlation tables and working them 
out (see p. 433), only 22 values were determined. 

* Bimuetrika^ Vol. xxvr. 1984, pp. 1 — 11. Miss David gives for logio 
t Reissue Diometrika Office, University College, London, price 42». net. 
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TABLE XIII. 

Cro88-Coirrelations. A. Index of a Bone with the Index of a Second Bone on the 
other side contiguous with the homologue of the First^, 


Index Pair 

Correlation 
and p.e. 

Index Pair 

Correlation 
and p.e. 


S^/S,{L) 

+ *15208+ 0240 

PJPx 

Pi/ndP) 



^ 2/^3 

Mx^lMxx {R 

Ss/S^(R) 

+ •1139+ *0243 


Ch/ihdP) 

— 

), AflJMli (L) 

- -0905 + *0290 

Pi! P i (P)x 

(h/o^dP) 

— 


- -1185 + *0293 

p-iiPi iP), 

OdO, dl) 

— 

P^fPUR), 

Os/0^{L) 

-•1142 ±*0242 

P 2 IP i {P\ 

OdO,dP) 

— 


On/O^dR) 

~ *0809 ±*0245 

PnlP^{R), 

OJOj(L) 

— 

PsfPi iR)x 


+ •0784 ±*0243 

P-l/P*dP)x 

(h/OrdP) 


Puip^iL), 

+ •0999 ±*0246 


OdO, dP) 

— 


OJOo(L) 

+ -0823 ± -0233 


OdOjdP) 

— 

PilP’A {R)'i 

O^/O^dR) 

+ -0757 ± *0233 

TjndP\ 

0d07dP) 


P-JPi {R). 

Ss/St (L) 

- -0700+ -0240 

n/n(/^), 

OdOAP) 


-V-S4(/0, 

Mx.JMxidL) 

- -0095 ± -0243 

- *0609 ± -0301 

(^)j 

n/n (/>), 

ojo„(/.) 

O^IO„{R) 

— 

SrJS^ (Lh 

Mxttf Mx\ {R) 

— 

nin w. 

Afl/A\ dP) 

— 

F^jFx 

Pi/PiiQ 

+ •0649 ±’0242 

PdP7dP) 


P 2 IP 1 <R) 

— 

TJT,dR)x 

Ao/A7(/.) 

— 


Mx:,IMx\dP) 

— 

TJTt(L), 

(/£) 

— 


Mx^iMx^dP) 

•0427 ± *0298 

Te/T,dP\ 

A 6 /A 4 dP) 



dL) 

+ *0211 ±-0260 

To/ndP)x 

A,/A4 dP) 



Mh/Mh (/?) 

+ *0426 + -02.68 

avAtC/O, 

Od Oq dP) 



Oy^/O^dl) 

- -0270 ±*0248 

Sa/S, (i). 

OdO^ dP) 

— 

SrJS.iD, 

0^/0^ dP) 

- -0075 ± *0245 

Ao/A? dR)x 

OdOr dP) 

— 

lilFi 

PJPidl) 

+ *0267 ± -0242 

AVAv(Z), 

OdO, dP) 

- 

Pa/Pa dP) 

— 

R^/S^dP), 

OdOjdP) 

— 

F2lFx (/O, 

MLilMh dl) 

+ •0205 ±-0243 

AV,/A4 dP)-> 

OdO, dP) 

— 


Mh/Mli dR) 

+ -0235 + -0244 

SrJSidP)x 

MxdMxdP) 

— 

fM (P)y 

Pi/ Pi (X), 

S 11 /P 7 dl) 

dP) 

T^/T^dR) 

— 

RrJR^dP). 

SjS\dP)x 

MxdMxidlf) 
Mh/Mh dP) 

— 

P 2 /P 1 {R\ 

Pi/Pi (^)» 

— 

AVAt (/-), 
AVArC^), 

MldMly dP) 
MxdMxx ( L) 

— 

P'aIPa{R)x 

TJT^dL) 

— 

AVAt (Z), 

MX 2 IMX 1 ( R) 

— 

PzIP^ {I)x 

n/n(/o 

T^/T^i dP) 


Mxs/MxidR)x MLJMlidL) 

— 

Pi! Pi (A), 


Afxs/Mx^ (A), MI 2 /MI 1 dR) 



B. The correlation of an index on one bone with the index of a bone on the 
opposite side non-con tiguous with the homologue of the first bone. The values of 
such correlations are given in Table XIV. In the same table arc. given the slightly 
contiguous values. In the first case there are 14 out of 40 correlations, and in the 
second case only 6 out of IG. If these be pooled there will be 20 out of 56. For 
the purpose of comparison we give some results in Table XV below. 

These results are noteworthy, they indicate that for the purely numerical values, 
which are really the more important ones (see p. 443), the cross-correlations do not 
fall short of the direct correlations either for contiguous or non-contiguous pairs ; 

* As in previous tables the correlations which have not been worked out are indicated. Had time and 
energy sufficed for working out the whole series, we do not believe that the results would have differed 
much from those of our practically random sample. 
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TABLE XIV. 

Cro88-Ooirrelation8, B. Index of a Bone with the Index of a Second Bone on the other 
eide non -contiguous or only slightly contiguous with the homologue of the First, 


Index Pair 


0 , 10 ^ (Ji), 
0 ^ 10 , (Z), 

P^IPi W, 
P3lP*(P), 
P 3 /P, (.1), 

F,lf\ (L), 
F,IF,iR), 
F2/i\ {L\ 
O^IOoiR), 

Os/Oo{L\ 

F 2 /F, (A), 
F^IF, (R), 
F 2 /F 1 (Z), 

/VA(^), 
A/ A (A), 
FJFi (/O, 
P 2 IP 1 (A), 
P2/P1 
P2/P1 (A), 
P^IPa{R). 
P 2 IP, (A), 
T2ITh{P\ 
A/ A (A), 
Tjn(R\ 


(A) 

MX 2 IMX 1 (R) 
Mx^l Mxi (Z) 
Mx^JMxi (M) 
MlilMlj (Z) 
Ml^/Mh {R) 
Mx^fMxx {L) 
Mx^jMxi (A) 
A/ A (Z) 
T2frs(R) 
OJOoW 
OslO^iR) 

(Z) 

Ml.JMl, (R) 
TJT,{L) 

n/A(/e) 

(Z; 
O^/OAP) 
MxJMxi (Z) 
MX 2 IMX 1 (R) 
MxqIMxi (Z) 
Mx^lMxy (R) 
MUMl^ (Z) 
MI 2 IMI 1 (R) 
Mx^lMxi (Z) 
Mxs/Mxi (R) 
Mx^fMx^ (Z) 
Mx.JMxi (/£) 
MX 2 IMX 1 \Ij) 


Correlation 
and p.e. 


+ -0910± 
+ *0900 ± 
-h*0669± 
+ -0801 ± 

- -0246 ± 

- -0635* 
-f-0693± 
+ -0544 ± 
-•0515 + 
-•0447± 
-f’ 0157± 
+ -0416± 
-•0076± 

- *0382 ± 


0286 

0283 
0298 
0308 
0268 
0261 

0284 
0283 
0229 
0231 

0231 

0232 
0248 
0248 


Index Pair 


PafPb (P')t 
P 3 IP 3 

03lO,{S), 
0,10, (L), 
0,10, (H), 
0,10, (X), 
0 ,f 0 ,(R), 
0,10, (X), 
0,10, {R), 
0,10, (X), 


P,IP*(a), 

P,IP,(L), 

T,IT,{R), 

F,IK(R), 

zyr, (X), 

Pi/PiW, 
P,IP,(R), 
Pt/Pt W, 

P,IP*(P), 
PalPtiJ^), 
Pi/PiW, 
PalP, (L), 
T,IT,{R), 
r,/T,{Z), 


MX 2 IMX 1 {R) 
Mx^jMx^ (Z) 
Mx^JMx^ (R) 
MxslMxi (Z) 
J/’^g/J/x, (R) 
JMX 2 } (Z) 
Mx2lMxx {R) 
Mx^f Mxx (A) 
Mx^jMxy {R) 
Mt2lAfli (L) 
MI 2 IBI 1 (R) 

Slightly 

(A) 
Sb/S.CP) 
MI 2 /MI, (L) 
Ml^lMh {R) 
JfX2l^^i (A) 
Mx2iMx^ (A) 
Aix^lMxi IL) 
Mx^jMx, {R) 

P 6 IS 7 (a5 
P^/PiiR) 

(A) 

Se/S^ (R) 
SbISaU) 

Pb/SUR) 
MI 2 IMI 1 (A) 
(R) 


Correlation 
and p.e. 


contiguous 
+ •0722 ±*0250 
4- 0631 ± 0256 
+ •0063 ± 0246 
+ *0328 ±-0246 
+ •0196 ±*0282 
+ •0323 ±-0287 


TABLE XV. Means of Direct and Gross-Correlations of Indices. 



Direct Correlations Means 

Cross-Correlations Means 


Arithmetical 

Algebraical 

Arithmetical 

Algebraical 

Contiguous 

•0666 

- -0164 

•0656 

+ -0035 

N on -con tiguous 

•0397 

+ •0197 

•0478* 

i 

+ -0248+ 


the only real difiference occurs in the algebraical values of the contiguous pairs; even 
here the average algebraic correlation of the cross pairs is higher than for the direct 
pairs. The correlations are all sinall but many of them must be considered as 

* This is -0521 without the “slightly contiguous.” 
t + *0192 without the “slightly contiguous. ” 
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significant. The result is a striking instance of the power of homology, the correla- 
tion of the index of a bone with that of a second bone on its own side is on the 
average not more intense than it is with the homologue of the second bone on the 
opposite side. A comparison of the individual correlations in Table XIII with the 
corresponding correlations in Table XI shows to what extent this is true for individual 
piiirs of contiguous bones, and a like comparison of Tables IX and XIV indicates the 
character of the relationship for non -contiguous individual bones. The reader will 
find some discrepancies where the difference is very considerable compared to the 
probable error of the difference. Thus compare 

F^IFt(R\ Pi/Pl(/^)=~•0992 with F^IFi(R). Pa/Pi (L) = -h 0549. 
Further compare 

T^jlMR), 08 / 09 (P) = + *0654 and Og/Og (X) - + ‘0604 

with T^ITziL), 08 / 09 (ft)= + -0757 and T^jT^^R), O 8 /O 9 (X)- + *0823, 

both cross-correlations being greater than both direct correlations. Again we may 
compare 

F^IF^{R), P 8 /P 4 (P) = + 0689 and F^IFxiL), P 8 /P 4 (X) « -4- *0032 

with F^jF-y P 8 /P 4 (X) = + *0267 j 

and once more 

Pf/Pi (P), MhjMly (P) = + *0368 and P 2 /P 1 (X), M^Mly (X) = + 0146 
with F^/FiiRl 0205 and F^/FyiL), (P) = + *0235. 

Many of these results can be attributed to the random sampling, for the correlations 
are so small as to be of the same order as their probable errors. While it is possible 
that if the correlations had been worked out for the same pairs of bones in each 
ctise, the bones on the same side contiguous and noii-contiguous might have shown 
on the average higher correlations than with the corresponding bones on the opposite 
side, the differences if significant are so small that we cannot assert that they exist 
when a random selection of indices has been worked out for each category and we 
base our conclusions on these. We can, however, assert that the factor of homology 
is so induential that with the small correlations which exist between the indices of 
non-horaologous bones, we cannot prove on our data that the correlation of the 
index of a bone A on one side with the index of a bone P, contiguous or non- 
contiguous, on the same side is definitely greater than the correlation of the index 
of A with the index of the bone B' on the opposite side, where B' is the homologue 
of P. 

(11) Correlation of Indices with Laterality and without Laterality, 

We define an index with laterality as one based on the ratio of two lengths both 
taken on the same bone, that bone being situated on the right or left side of the 
skull. An index without laterality is based on the ratio of two lengths neither of 
which is peculiar to one side of the skull, but both are measured on the same bone. 
A list of these non-bilateral lengths is given on p. 427. We have confined our 



Karl Pbarson and T. L. Woo 


451 


attention to five such non-lateral indices, namely two on the frontal, FzjF^, Fs/F^, 
and three on the occipital, O 4 /O 1 , 0^10^ and 0^1(0% + Os). We will deal first with the 
non-contiguous, or only slightly contiguous values, our Group (x). Thu correlations 
are given in Table XVI. The mean value independent of sign for the non -con- 
tiguous correlations is ’0672, and for the slightly contiguous group *0467 we have 
no hesitation in pooling the two series, for we should anticipate that the latter 
would have been the higher. The two series combined give a mean numerical value 
of ‘0604. This is somewhat greater than the mean numerical value of the correlations 
for the contiguous bones in Table XVII (p. 452), i.e. *0538. No clearer confirmation 
could be found of the statement that contiguity has little to do with the mean 

TABLE XVL 


Correlation of Indices on non^contigvons Bones, one Index of the pair 
having no Laterality, 


Index Pair 

Correlaticiii 
and p.e. 

Index Pair 

Correlation 
and p.e. 


T2lT:,(R) 

-f‘ 0893±*0229 

F,/F,, 

OslO^il) 



F 1 , 


— 

fnlF4, 

OsfO^(R) 

+ •0591 ±-0231 

Obi ( ^2 + ^y)» 

(li) 

+ •0754 ±-0285 

FslF^, 

Oh/Op (A) 

— 

CKI(.(h-¥0^, 

Af.V2lMx^ {L) 

n/n w 

Tjn (A) 
Tjmn) 

+ -0644 ± *9231 

f\IP’4, 

Slightly uoiitiguouw 
Mx,JMxi {R) 1 + *0571 ± -0281 


+ *0636 ± *0231 


{L) 

— 



— 


Jhfx^jMxi {R) 

- -0626 ± 


OJO^ {It) 

+ •0629 ±'0230 


Mx^lMx^ {L) 

— 


ihKh ^L) 

— 

K/F4^ 

{R) 

-•0401 ±-0306 

ojo,. 

{R) 

-*(X>19±*a311 

^41^4, 

Mx.il Mx^ (L) 

— 

OJO,, 

Af.r j ( 7/ ) 

— 


Mx2lMxi (R) 

+ •0368+ -0281 

F -j / 4 , 

(hlOy{R) 

+ •0612 ±-0231 


MX 2 IMX 1 {L) 

— 


values found for the series in Tables XIII and XIV. We have to admit that our 
correlations are far from an exhaustive list of possible correlations, which might be 
worked out, but those chosen have not been in any way selected to give high or low 
values of the correlations.* If there be really any sensible difference between the non- 
contiguous pairs in Table XVI, and the contiguous pairs in Table XVII, we can 
hardly discover it on the values worked out thus far. 

(12) Con'elation of Indices on Contiguous Bones, one of the Indices correlated 
having no Lcuterality, 

Of the 28 indices falling into Table XVII, no less than 19 are negative. In this 
the table differs markedly from Table XVI where out of 12 correlations 9 are positive. 
The negative correlations have a mean of — *0541 and the positive a mean of -P *0533. 
When we consider only the numerical value the mean correlation is, as we have 
stated in the last section, *0688. It is difficult however to lay stress on the near 
equality of the mean positive and negative correlations for we can change the sigfn 
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by inverting the ratio. Thus if we had taken for our index 100 O1/O4, which would 
have made the mean of that index under, instead of over, 100, we should have 
found PslP 4 ^(R) increasing with O1/O4, instead of decreasing as it does with O^Oi; 
similarly if we had taken our index 100(08 4- OsVOs, which would again have given 
a mean value under instead of over 100, we should have got the opposite sign for 
the index correlation (JR), (O2 4- OaVOs to that of Ss/S^(Ii) and 4- Os). 

In other words the signs of index correlations, as we have before stated, are at our 
choice, depending on which constituent of the ratio we assign to the numerator, 
which to the denominator. 


TABLE XVII. 


Correlation of Indices on Contiguous Bones^ one of the Indices correlated 

having no Laterality, 


Index Pair 

Correlation 
and p.e. 

Index Pair 

Correlation 
and p.e. 

Ps/1'4 (R\ 

OJO^ 

- -1290 ±-0241 

FrJF„ 

{R) 

- -0607 ± -0243 

(/>), 

6>4/Oi 

- - 


(Z) 

— 


<h,KO^ + 0:,) 

- *1154 ±'0245 

F ilF 1 , 
F^IF„ 

>SV*i>V (/0 

+ -0439 ± 0237 

(L), 

05((02-\- 0:i) 


S^IS,{L) 

— 

K. 1 F 4 , 

Ps/PiW 

+ '1103± *0239 

.%/-»,( /i). 

0,10, 

- -0.361 ± -0241 


/V /^4 {L) 

- 

SJS-! {L), 

0,10^ 

— 


OJOs 

- '1053+ *0244 



- -0252 ± -023.5 


Oe/Os 

— 

PfifPh (Z/’X 


— 



- -0807 ± -0243 



-'0247 ±-0234 


(Zr) 

— 



— 


OJO, 

- *0786+ '0247 

PsIPa. 

MlJMl, (R) 

- -02.39 ± -0244 



— 

F j // 4 , 

MlJMl, (Z) 


n/^4, 

^V*5>4(/0 

-•0764+ '0243 

o,Kh 

+ 0231 + -0235 


S^fS^L) 


T2lT^ (P\ 

0,10, 

— 

FJF,, 

PJPii/t) 

- '0733 + '0242 

F~JPi (R), 

Oe/O^ 

+ -0227 ± -0245 


PJPiW 

— 

PJPi (ZX 

On/0.j 



P 2 IP 1 w 

- '0726 ± '0240 


0^1 {O 2 + Os) 

-•0220 +'0236 


P 2 IPI (P) 

— 


0^f{p2-¥ Os) 

— 


+ '0707 ± -0246 


0,J{02 + 0s) 

-•0121 + -0236 

PziP^ (^), 


— 

T 2 IT 2 (Z), 

Ojeps + Os) 

, ™ 

P 3 /P 4 , 

PsIPAiP) 

+ •0704 +0241 

PbfPA. 

Sa/Sr (R) 

+ •0113 ±-0237 

P:i/P4’» 

PhIPaW 

^ - 

PrJPA. 

Ss/Sj (Z) 

P2lPl(P\ 

OJOr 

- -0663 + -0244 

PJPlip). 

0rJ(02 + 0s) 

- -0089 ± -0246 

P'JP\ 

O 4 / 0 , 

— 

A//^ (ZX 

0^1(02+ Os) 

— 

P^IP^ (^)> 

Og/ O 3 

+ •0f>62 ± 0244 

^6/^7 (/iX 

0bl{02 + 0s) 

- -0087 ± -02.39 

P^IP\ (^X 

OJOs 

— 

(Z), 

ObK02-¥Os) 

. 

n/n(/o, 

OJOi 

+ *0609 ± -0235 

^oAS^7(Z?), 

Of^lOs 

- 0066 ± -0238 

7^0/n 

OJOx 

\ 


^6/^7 (ZX 

OelOs 

— 


As it does not seem possible to lay any stress on contiguity as differentiating the 
correlations given in Tables XVI and XVII, we will pool the 40 correlations, which 
then provide a mean numerical correlation of *0558, and simply ask whether these 
40 correlations of lateral with non*lateral indices may be taken as a random sample 
from parent populations with a correlation coefficient of r = *0558, This inquiry, 
if very laborious, will enable us at any rate to say whether it is reasonable to 
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represent by a single value of »• the influence of non-lateral indices on lateral indices. 
The results are as follows: 


Probability Integrals: From left to right From right to left 

log X« - - 1.5-894,7938, log V„ = - 16 017,2577, 

Jn login e = 2-746,71958, Vre logm e = 2 746, 71958, 

V = 6-786,828, u' = - = 5-831,4136, 

V n logic e yjn logic e 

u) = /(39. 5-786.828), 24')=/(39, 5-831,4135). 


Hence from the Incomplete T-Function Tables: 

PAn = -3083, Pvn- *3251. 

Q^^ = *6917, «a'„ = *6749. 

The latter is somewhat the more stringent. It indicates that in 100 random 
samples from the population of correlation coefficients with mean -0558, 67 would 
give a more improbable result than our system of correlations of indices of lateral 
and non-lateral bones. We do not assert that our system is such a selection, but 
we do assert that it is a highly reasonable description of what has been observed, 
that is to say a successful graduation. In other words the relation of lateral and 
non-lateral indices can be expressed by the single number *0558. 


(13) Correlation of two Indices ofihs same Bone. 

This is our Group (vi) and the correlations falling under this heading are given 
in Table XVIII below: 


TABLE XVIII. Correlation of two Indices of the same Bone. 


Index Pair 


(^*i) 

(“2) 

(5) 

(^) 

W 

{ck) 

(«i) 

(/ 2 ) 


O4/O1, 

OJOu 

niK, 

Oal{0.i + O.S), 

ojo„ 
0,1 o„ 
o«/o«(fl), 

{I\ 


{l\ 

(, ‘ 

(- 


(a,) Mx^iMx^ {H), 


(t*2) 

(/fO Fjf,, 


O^/ChiL) 

F^IF, 

o^jCh 

0,l0u (R) 

(h/O^ (L) 

(hf07(R) 

F,/F, 

OhIO,(Z) 

F,IF, 

(L) 

MxJMxi {R) 
FJFAZ) 

F^lF.iR) 


Correlation 
and p.e. 

Index Pair 

Correlation 
and p.e. 

+ •6672+ -0129 

(ii) 


^%/>S,(R) 

+ -1633+ -0247 

+ •6042 +*0147 

( 7 * 1 ) 

fjP4(R), 

FJPi(R) 

- 1133+ -0239 

+ *6066 + -0168 

(^) 

0J(0^+0a), 

OJO, 

-•1109+ -0231 

+ •4764 + -0180 

(0 

0,1 o„ 

0^10, 

- -0894 + -0233 

+ •3744 + *0201 

(w,) 

(hJO^iR), 


- 0865 + -0232 

+ -3130±-0211 

( 72 ) 


A/ACi) 

-•0753± -0241 

+ •3094+ -0210 

Oh) 


^ 2 / P-A {R') 

+ -0679 ± -0233 

+ •2677 + -0210 

(Ol) 

O^HO^ + O^), 

OJO^iR) 

- -0667 + -0233 

+ •2573+ -0216 

(wijj) 

OslO^iL), 

OalO.^ 

- 0584 ± -0233 

+ •2566 ±-0212 

(Pa) 

Oe/On. 

0J0,{L) 

- -0549 ± -0232 

+ •2284 ±-0303 

(^ 2 ) 

mniD, 


+ -0460 + *0233 

+ •2103+ -0311 

(?2) 


O^KhiL) 

+ -0416± -0233 

+ •2062 ±-021 8 

(yi) 

0al(02 + 0^), 

Os/OriR) 

+ 0327 ± 0233 

+ •1909+ -0242 

(Pi) 


(h/O, {R) 

- -0135 ±-0234 

+ -1786± -0220 



0,10,, (L) 

-•0032 ±*0234 


The small letters in the corners of Columns 1 and 3 indicate pairs where 
they exist, the subscript 1 for right and 2 for left. 
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It will be seen at oncje that indices on the same bone present a wide range of 
correlation values. But the source of these differences is fairly easy of analysis. 
Thus: 

(а) O 4 ~ 0% (H) 4- Og (i) and there will be a high positive correlation between 
Og (B) or Og (L) and O 4 if there be little between the other occipital measurements 
involved. 

( б ) F 4 is the denominator of both indices, hence if there be little correlation 
between the other frontal lengths (b) should show a considerable positive correlation. 

(c) Os is common to both index denominators and this should involve positive 
correlation in the indices. 

(d) The same condition holds as in (a) and involves positive correlation. 

(e) Os is the numerator of both indices and positive correlation follows. 

(D and (h) F 4 is nearly parallel to and so to speak covers it, so that Fi can 
be almost looked upon as a part of ^ 4 , and positive index correlation may be 
anticipated. 

{g) Mx\ is the common denominator of both fractions, and we may anticipate 
positive correlation. 

(i) S 4 overlies ^ 87 , or the latter may be looked upon as part of the former; 
positive correlation will result. 

(j) Fs and Pi probably possess considerable correlation, so that a negative 
correlation would arise between the (j) indices. 

(k) and (1) Oi is likely to have a negative correlation with both Og and O3, thus 
productive of a negative correlation in the (A.-) and (/) indices. 

(m) to (q) In the remaining indices one does not see any special reason for their 
magnitude or sign. 

From the slender material we have, we may reason that this system of correla- 
tions of indices of the same bone is more or less capable of interpretation, if a high 
correlation arises when the two indices have a common factor, a somewhat lower corre- 
lation when a factor of one overlaps a factor of the other, and little or no correlation 
when neither of these cases can be seen to hold. Until the correlations of absolute 
lengths on the same bone have been worked out, it is not possible to ascertain the 
limits of the truth of this statement. But we may consider whether the 10 correla- 
tions in the m q group could be reasonably considered as a random selection from 
a parent population of zero correlation. The values of the correlations in these 
groups are : 

- 0805 ± -0232, + -0460 ± ‘0233, 

+ *067 9 ± *0233, + -0416 ± *0233, 

- *0667 ± -0233, 4 - *0327 ± 0233, 

- 0584 4 *0233, - -OISS 4 0234, 

~ *0549 4 0232, - *0032 ± *0284. 

Mean of the six negative correlations = — ’0472, 

Mean of the four positive correlations « *0471. 
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The probable error can be taken as *0233 for the system, or the standard error 
as *034646. Working out the probability integrals and their logarithms, we have: 

logic Xt* = - 7*358,4735, logjo X'n = - 4*868,7097, 

Jn logio e * JTO X *434,2945 « 1*378,3598, 
u = 5*358,0086, w' « 3*546.1086, 

Pa,, = /(9, 5*358,0086) = *9731, Pa ,, = 7(9, 3*545,1086) = *6818, 
Qa^»*0269, Qa,, = *3182. 

Notwithstanding the value of which would warrant the “ reasonableness ” 
of the hypothesis, we feel bound to take the more stringent for our guide, or 
more probable distributions of the correlation values would occur in 97 % of 
random samples than in the sample actually observed if they were taken from 
a population of zero correlation. 

Now there are those who accept a < *01 probability as justifying the rejection 
of an hypothesis; others demand a< *05 probability. We hold that in each case 
the actual probability should be determined, and then judgment formed in relation 
to the material on which it is based. In our case, we do not think that Q^n ~ ‘03 
about, justifies us in accepting the hypothesis that, if we exclude cases of common 
factors in the indices and overlapping,” there is little or no correlation of indices 
on the same bone. Wo prefer to wait until the correlations of absolute lengths on 
the same bone have been measured, and we can trace the contribution of estch 
absolute length to the make-up of the index correlations. 

What we feel confidence in asserting is that common factors, and, to a less 
extent, “overlaps,” do increase the numerical magnitude of the correlation of 
indices of the same bone. 

(14) Influence of Homology on Indices taken on like Bones, 

This section will deal with our Group (viii). Unfortunately, the number of 
correlations falling under this heading and already worked out are few. They are 
given in Table XIX below. 

In this table, beside the correlations of heterologous indices on homologous 
bones, we have the correlations of heterologous indices on the same bone, and 
further the homologous indices of homologous bones. It would be of great 
value were we able to deduce approximate values for the correlations in Column 2 
of Table XIX from the data given in Columns 4, 6 and 8, that is to say if we 
could deduce the cross-correlations from our knowledge of direct correlations and 
the correlations for homology. 

Formulae, having at least some theoretical basis, have been suggested with 
this end in view. Let A and B be two characters, and the subscripts R and L 
refer to the right and left sides of the skull, then we will test the following 
relations : 

(£). 


^ i ^ ‘‘BlBr + rARBK ^ 



Correi Hm of Heterologous Indices rni a pa\ of Homologous Bones. 
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The observed values are set against the values deduced from the above formulae 
in the following table ; 

TABLE XX. Observed Cross-Correlations compared with Values computed from (e). 


A B 

Observed 

Computed 

Remarks 

Sc/SA/t), S^/St{L) 

n/Tu/t), T^/mz) 

PJPUR), Pi/Pi(Z) 

P,/Pt(.Z), P^jPxiR) 

O^IO„(R), (hlOliZ) 

0^/0^ {L), OJO,(R) 

Mx^fMx^ (Z), Mx^jMxi (R) 

-4- *1787 ± *0244 
+ •1863 ± 0245 
+ -0322 ± 0234 
+ •0416 ±*0234 

-•1169+ -0245 
-•1756 ±*0240 

+ •0700 ±-0231 
+ •0584 ±-0231 

+ •1867 ±*0316 
+ •1757 ±-0324 

1 

+ •1584 f 
+ -1581 1 
+ •0412 1 

+ -0392 \ 

- *0600 
- 0525 

+ •16486 
+ •1515 

+ •1892 / 
+ •1884 1 

Observed in excess, but compatible 
considering the probable errora 

Observed and computed, quite 
compatible 

Observed in excess of computed, 
and differences pointing towards 
significsauce 

Observed in defect of computed, 
and differences ix>inting towards 
significance 

Observed in defect, but differences 
so slight as to render the results 
compatible 


The differences between the observed and computed correlations run first one 
way and then the other. The worst cases are those for the parietal and the 
occipital indices, but while for the former the observed are considerably in excess, 
in the latter the observed are considerably in defect. The results are on the 
whole not encouraging for the value of the formulae (e). 

The formulae cannot, however, be at once dismissed as not even approximate. 
It seemed worth while testing the formulae (e) on a wider field. Table XXI is 
accordingly given here, although it belongs to another section of our work* on 
morphometric characters, namely, to that dealing with the correlations of hetero- 
logous absolute lengths. 

Table XXI gives 120 correlations of absolute lengths, enough material to test 
formulae (e) on 56 cases of absolute measurements. We do not intend in the 
present paper to analyse this material, but we may draw the reader's attention to 
one or two points having a bearing on our present inquiry. 

Putting aside the Homologous Correlations, there are only four out of 28 double 
cases, where either direct or cross-correlation exceeds *35. These are F% (about 
*64), Ai, P 2 (about *54), PJj, ^2 (about *46) and Tg, (about *37). On examination 
of our diagrams (pp. 425,426) the reader will see some reason for these relatively high 
correlations. But he will probably be surprised to see how low the correlations on 
the whole are; out of 112 non-homologous correlations, 96 are below *25 in value! 
It follows that the factor of size is not nearly as important as that of homology in 
the relationship of different parts of the skull. 'J'^hc last two columns of Table XXI 

* Prom the 63 original measurements, nearly 2000 correlation tables could be formed — ample 
material for a further section of this paper on correlation of absolute lengths of individual bones of the 
skull. Only some 120 are given in Table XXI. 




TABLE XXL Correlation Coeficients of Absolute Measurements. 
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show us that the lowest homologous correlation is *6738, and they fill the range 
from that value up to *9766. But the highest absolute size non-homologous cor- 
relation is *6449, and they range from this numerically down to *0091. The only 
certain negative correlation is that between the parietal arc Pa from bregma to 
sphenion, and the sphenoidal chord from krotaphion to the postreme point of the 
sphenoid as exposed on the base of the skull. 

Thus it would seem that the factor of size as judged by the correlation of 
absolute lengths of the cranial bones, while greater than the factor of contiguity, 
is still very much less intense than the factor of homology. 

While it may be shown from Columns 3 and 4 of Table XXI that correlations 
of absolute lengths on the left side, do not as a whole differ significantly 

from those of absolute length on the right side, yet the cross- correlations 

and smaller, but while giving considerable differences (see 

Table XXII, Columns 3 and 4), they arc yet again within the limits of random 
sampling. As the cross-correlations are less than the direct correlations, we are 
justified in seeing what values formulae (e) will provide for the cross-correlations. 
The computed values are given in Column 5 of Table XXII. They show at once 
that the differences of Column 7 are wholly trivial and thus 

unlike the differences in Column 4. Thus, whether the calculated cross-correlations 
do or do not equal the observed values, the numerical values of 

®'l^d ^ {VAlBr ^ ^'BbBr "b 'f'BjiAji ^ ^'ArAj^ 

are for practical purposes equal. 

On the other hand, if we compare the means of the observed cross-correlations 
^arBb ^i^d 't'alBr with the means of the corresponding values computed by (e), 
the latter are in all the 28 cases smaller with the one exception of and Mxxy 
where (e) gives a value larger by *0026. 

It is clear, therefore, that whatever the index correlations may do, (e) does not 
represent a general principle applying also to the correlations of absolute lengths. 
If Otn he an observed cross-correlation mean, i.e. Ol,r), and C^n. be a 

computed cross-correlation mean, i.e. J ((7/2 i "b we have calculated the 

linear relation of to (7,^ as given by Least Squares and we find the equation 

On. - 1*098,138 C,n + *005,998 (e'), 

where the constant term may be taken as *0060. 

The results arc exhibited in Table XXIII. 

It will be seen that the mean numerical difference between the observed and 
computed means is (*0094), and this is less than half the difference between the 
observed Hence, if we suppose the difference between the iJ, L 

and Ly R values of the cross-correlations to be merely due to random sampling, not 
to inherent right- and left-sidedness, it follows that formulae (e') will provide, well 
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TABLE XXII. 

Difference of Observed and Computed Gross-Correlations of Absolute Lengths. 


Index Pair 


Observed 

Correlations 


Mean 

R,L and L, 12 


Difierenoe 


Computed 
Correlations (c) 


Mean 

i2,L andL, R 


Difference 

R,L-L,R 


FiW, 
F, {/i), 
F,W. 
Fi{R), 
F^W, 
F{R), 

Fi (L), 
Fxi/l), 
Fi W, 

FAR), 
Fx [L), 
FxOt), 

FAR), 

fad, 

FAR), 

FAD, 

FAR), 

fad, 

FAR), 
Fi^D, 
FAR), 
FAD, 
FAR), 
FAD, 
PAR), 
PAD, 
PAR). 
PAD, 
PAR), 
PAD, 
PiiR), 
P 2 iD, 
PAR), 
PAD, 

0x(«), 

Oj (D, 
OAR), 
OAD, 
(h{R), 
OAD, 
OAR), 
OAD, 
TAR), 
TAD, 
TAR), 

tad, 

TAR), 

TAD, 

SAR), 

SAD, 

SAR), 

SAD, 

ith{R), 

Mtx^D, 


fad 

FAR) 

PAD 

PAR) 

OAD 

OAR) 

TAD 

TAR) 

sad 

SAR) 
Ml, [L) 
Mil {R) 
Mxx ID 
Mx, (R) 

PAD 

PAR) 

OAD 

OAR) 

tad 

TAR) 

sad 
saR) 
MlAD 
Ml, (R) 
Mxi (L) 
MxiiR) 

OAD 
oaR) 
TAD 
TAR) 
<S'r. {D 
SAR) 
Mh (D 
Mix {R) 
Mxx {D 

Mxx(R) 

TAD 

TAR) 

sad 

SAR) 

Mix (D 

Mli(R) 

Mxi(L) 

Mxx (R) 

SAD 

SAR) 

Ml, (L) 

Mli(R) 

Mxx (i) 

MxxlR) 

Ml, (D 
Mix (R) 
Mx, (L) 
Mx, (R) 
Mx, (L) 
Mxx(,R) 


•6314 ±-0136 
•6240 ± -0138 
•5095 + -0178 
•4885+ •0182 
•1724 ±-0222 
•1546+ -0224 
• 1808+ -0221 
•1830 + -0222 
•1020±-0231 
•1964±-0234 
•1619+ •0236 
•1438 ±-0238 
•2023 ± •0241 
•2019 + -0239 
•4695 ±-0187 
•4306 ±-0196 
•11 46 + -0226 
•0778 + •0228 
•11 10 + -0226 
•1488 + -0225 


•0493 + 8)240 
•0678 ± -0243 
•0852 ± -0241 
•0663 + '0242 
•1202 ±-0247 
•0629 ±-0249 
•2266+ -0230 
•1928 + -0234 
-•0092 + -0241 
•0317 ±-0243 
- 01641 •0248 
-•0668 + -0252 
•0623 ±-0263 
•0877 ±-0254 
•1201 ± •0261 
•0981 + -0261 
•0606+ -0231 
•0843 ±-0230 
•0427 ±-0243 
•02961-0246 
•06371-0244 
•06471-0245 
•14501 -0248 
•1733 1-0244 
•35831-0211 
•3728 1-0211 
•09581-0243 
•11461-0241 
•09561-0251 
•12631 -0247 
•1657 1-0253 
•17721-0248 
•21601-0257 
•21501-0267 
•17761 -0266 
•17771 -0266 


} 

I 

} 

} 

1 


•6277 

+ •0074 

•4990 

+ •0210 

•1635 

+ •0178 

•1819 

-•0022 

•1942 

-•0044 

•1628 

+ •0181 

•2021 

+ •0004 

•4501 

+ •0389 

•0962 

+ •0368 

•1299 

- •0378 

•0686 

- -0185 

•0757 

+ •0189 

•0916 

+ •0573 

•2092 

+ •0328 

•0113 

- -0409 

- -0416 

- -0604 

•0750 

- *0254 

•1091 

+ •0220 

•0724 

- -0237 

•0361 

+ -0131 

•0542 

-•0010 

•1592 

-•0283 

•3656 

- -0145 

•1052 

- -0187 

•1109 

- -0308 

•1665 

-•0215 

•2150 

•0000 

•1776 

-•0001 


{ 


•5946 

•5945 

•4339 

•4329 

•1343 

•1353 

•1641 

•1653 

•1697 

•1711 

•1385 

•1386 

•1813 

•1816 

•3659 

•3628 

•0800 

•0800 

•1146 

•1157 

•0460 

•0469 

•0605 

•0606 

•0856 

•0854 

•1597 

•1606 

•0015 

•0000 

- -0401 

- -0395 
•0594 
•0592 
•0903 
•0916 
•0571 
•0576 
•0203 
•0203 
•0464 
•0459 
•1313 
•1315 
•3270 
•3265 
•0963 
•1000 
•1138 
•1132 
•1672 
•1604 
•1995 
•2036 
•1684 
•1681 


} 

} 


•6945 

+ •0001 

•4334 

+ •0010 

•1348 

- -0010 

•1647 

- -0012 

•1704 

-•0014 

•1385 

-•0001 

•1815 

-•0003 

•3643 

+ -0031 

•0800 

•0000 

•1152 

-•0011 

•0465 

- •OCXIO 

•0605 

- -0001 

•0855 

+ •0002 

•1601 

-•0008 

•0007 

+ •0015 

- -0398 

+ •0006 

•0593 

+ •0002 

•0909 

-•0012 

•0574 

-•0005 

•0203 

•0000 

•0461 

+ •0005 

•1314 

-•0002 

•3267 

+ -0005 

•0982 

- -0037 

•1135 

+ •0006 

•1688 

-•0032 

•2015 

-•0041 

•1683 

+ •0002 
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TABLE XXin. 

Comparison of Obseirved and Computed (by (e')) Cross-Correlations. 


Index Pair 

Observed 
Mean of 

Tv and X, R 

Difference between 
observed 

R, X and X, R 

Difference between 
observed Mean 
and Mean 
computed by («') 

Computed 

Mean 

R, X and X, R 
by (e') 


'6277 

4- -0074 

- -031 1 

•6588 

^ 1 , P 2 

•4990 

4- -0210 

+ *0121 

•4869 

Fx, Ot 

•1636 

-h-0178 

+ •0095 

•1540 

Fi, 

•1819 

-•0022 

- -0049 

•1868 

F\ y aS/, 

•1942 

-•0044 

+ •0011 

•1931 

Fx, Mix 

•1528 

4 - *0181 

-•0053 

•1581 

Fx , MiJCx 

•2021 

4 - -0004 

- •(X)32 

•2063 

^ 2 , A 

•4501 

4- *0389 

+ -0440 

•4061 

F^x 0 , 

•0962 

4 - -0368 

+ -0023 

•0939 

^ 2 , n 

•1299 

- -0378 

- -0026 

•1325 

F,, Sr, 

•0586 

- -0185 

+ •0015 

•0571 

F2x Mix 

•0767 

+ •0189 

+ -0033 

*0724 

F 2 , Mxx 

•0916 

+ 0573 

- -(Xia.*! 

•0999 

A, <h 

•2092 

+ •0328 

+ -0274 

•1818 

Ta 

•0113 

-•0409 

+ •0045 

•0068 

A, 

- *0416 

- 0504 

+ •(X)39 

- *0377 

p2x Mix 

•0750 

- -0254 

+ -0039 

•0711 

Pi, Mxx 

•1091 

+ -0220 

+ -(XiaB 

•1058 

0 -,, Ti 

•0724 

- -0237 

+ •0034 

•0690 

()•; , Sr, 

•0361 

+ -0131 

+ •0078 

•0283 

Ot, Mix 

•0542 

-•0010 

- -0024 

•0566 

Oj , Mxx 

•1592 

- 0283 

+ •0089 

•1503 

T 2 , s. 

•3656 

- 0145 

+ -0008 

*3648 

Ti, Mix 

•1052 

- -0187 

- -0080 

•1138 

Ti , Mxx 

, -1109 

- -0308 

- -0197 

•1306 

Sr„ Mix 

•1665 

- 0215 

-•0139 

•1804 

iSfi, Mxx 

•2150 

•0000 

-•0123 

•2273 

Mix, Mxx 

•1776 

-•0001 

- •oi;i2 

•1908 

Mfian Numerical Difference 

•0215 

•0094 

— 


within the limits of random sampling, the value of the cross-correlations in terms 
of direct and homologous correlations in the case of absolute lengths. 

General Conclusions. 

In considering the structure of the skull, we should a priori expect that the 
dimensions and shapes of the individual bones of which it is constructed would 
be controlled by certain factors, capable of being statistically investigated when 
appropriate measurements have been made on the individual bones. 

In the first place the skull is approximately symmetrical, having mirror sym- 
metry about the “median sagittal plane.” This approximate symmetry would 
appear to involve a correspondence in size and shape of homologous bones. We 
term the source of this the factor of homology, and measure it by the correlations 
between corresponding lengths and indices taken on the homologous bones. 
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Secondly, we note not only the approximate symmetry of a skull, but that one 
skull is larger than a second. We should anticipate that the size of one bone would 
be highly related with the size of a second. This leads us to a factor of size, which 
may be measured by the correlation of lengths taken on different bones on the 
same side, or on heterologous bones on opposite sides. 

Thirdly, we should consider that the bones of the skull fit one another or that 
there must be a relationship of shapes. This leads us to a factor of shape measured 
by the correlation between the indices of bones. 

Fourthly, we might anticipate that the shape of a cranial bone would be more 
influenced by an adjacent bone than by one not in contact with it. This leads us 
to a factor of contiguity. This is measured by the correlations between adjacent 
and non-adjacent bones. 

(i) It has been long a desire of statisticians to obtain statistical constants 
independent of size. For this purpose the coefficient of variation was introduced 
by one of the present writers as an attempt to get over the difficulty that a large 
length hiis usually a larger absolute variation. It is shown on the present data 
that the correlation between absolute size and absolute variation is on the whole 
not as large as might be anticipated, i.e. about -h*28*, while the correlation of 
absolute length and coefficient of variation is -- T2. In other words, wc reduce the 
relation of variation to size by more than a half by using the coefficient of variation, 
but it over corrects. The coefficient of variation varies rather inversely than directly 
as the size of the organ. 

(ii) The fact that coefficients of variation arc not equal even for two lengths 
measured on the same bone, led us to investigate whether variation was associated 
with the orientation of a measured length with regard to the fundamental planes 
of the skull. We divided up our data into broad classes, sagittal, transverse and 
vertical, with subclasses for lengths going in intermediate directions, and found 
that for coefficients of variation of absolute lengths there was least variation in the 
sagittal, most variation in the vertical, and an intermediate variation in the trans- 
verse direction. The mixed classes fitted well into this scheme. The same method 
was applied to the absolute variations (standard deviations), but the results were 
far less clear cut. 

It will be seen from this result that the laying down of a given bone of an 
individual skull does not proceed equally in all directions — bone growth is not 
isotropic. This may be due to the existence of more than one centre of ossification 
or to other causes at present not studied. 

(iii) Putting aside the like lengths on homologous bones, of the correlations 
of absolute lengths only 16 correlations out of 112 exceed *35, the remaining 96 
lie below *25. Those exceeding *35 are Ft with and P*, F% with and T% 
with S^, It is clear that the factor of size — except for these bones, frontal, parietal, 
temporal and sphenoid — is not an all-important factor in regulating the relationship 


Cf. Biometrika, Vol. xxn. p. 839, footnote f. 
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between individual bones of the skull. The mean numerical correlation of the 
whole 112 pairs is only *1751^. If this be compared with the mean correlation 
of the like absolute lengths On homologous bones, i.e. *8198*f, we see that the latter 
is 4'7 times the former, or the factor of homology is almost five times as intense as 
the factor of size. 

(iv) While lateral influence is found in absolute lengths (see p. 424, (ii)), it has 
no appreciable influence on either the magnitude or variability of the indices of 
either heterologous or homologous bones (see pp. 426, (iii); 435 — 436). 

(v) In the case of like characters of homologous bones, the factor of homology 
shows greater intensity on the facial bones, whether judged by indices or absolute 
lengths. The malar, maxillary and frontal bones have the highest correlation 
coefficients, or we may say that the face is more symmetrical than the brain-box. 
The sphenoid is exceptional. 

The mean index correlation for homologous bones is ‘7414, while that for 
absolute measurements of homologous bones is *8198. Thus the factor of homology 
is somewhat more powerful in the case of absolute lengths — in directing size — 
than in that of indices, i.e. governing shape. But in both cases the factor of 
homology is far more intense than that of any other of the factors we have 
considered. 

(vi) Confining ourselves to the relationship of indices of bones both op the 
same side, we endeavoured to measure the power of proximity in determining 
the correlation of indices. We divided our bones into non-contiguous, slightly 
contiguous, and contiguous groups. The numerical mean correlations were ‘0397, 
*0434 and *0655 respectively, showing increasing mean correlation with a greater 
contiguity. If w^c pay regard to sign, we have for the three groups : *0197, *0145 
and — *0164. Tfie former result indicates that we may pool the slightly contiguous 
with the non-contiguous, giving non-contiguous and contiguous groups tis *0407 
against *0655. The latter result is more or less meaningle.ss, owing to the idle 
character of sign in index correlations. We have shown in the course of the text 
that the whole system of contiguous bone indices might be reasonably looked upon 
as a sample from a population of correlation *0655. Thus the maximum influence 
of the factor of adjacency may be represented by *0655, as against the factor of 
homology *7414, which is more than eleven times as great. 

(vii) As far as shape — not size — is concerned, the magnitude of the homology 
factor is indicated by the fact that the cross-index correlations give numerically a 
mean of *0656 for contiguous bones, while the direct index correlations for the 
same bones give *0655, while for the non-ctmtiguous bones the values are *0456 

* The JLf Li correlations are somewhat greater than the 11 correlations, probably signiiicantly, but 
the difference is not of practical importance. The cross-correlations It, and X., It are sensibly equal 
but somewhat less than the direct correlations. The differences are again so small that we have a 
further illustration of the power of the factor of homology : see foot of Table XXI, p. 458. 
t Cf. Biometrika, Vol. xxii. p. 830, Tabic I. 
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and 'OSO? respectively. It is thus on the average indifferent whether we correlate 
the index of a bone A with that of a bone B, contiguous or not with it on the 
same side, or with the index of a bone B* homologous to B, 

Finally we can sum up as follows : 

Average intensity of factor of homology : 

For size '8198, for shape *7414, 

Average intensity of factor of size for non-homologous bones : 

For size *1761*. 

Average intensity of factor of contiguity at a maximum : 

For size-f- , for shape *0665. 

From these numbers it will be clear to the reader that the only factor con- 
trolling the shape and size of the skull as a whole worthy of real consideration is 
that of homology. Homologous bones must grow both in shape and size alike — we 
have no suggestion to make of why they do so, we only note the fact that they 
do so. On the other hand, the factor for size is of small intensity ; we cannot 
predict, except in a very few cases, the absolute size of one bone from another. 
Nor can we predict from the shape of an individual bone what will be the shape 
of a second, whether it be adjacent to it or not. It is as if bones spreading from 
their centres of ossification grew wholly individually, except that homologous bones 
grow alike. There is no fitting of adjacent bones to a standard pattern, no general 
control factor or factors (beyond that of homology) by which the skull as a whole 
is organised. There is no system of hereditary determinants guiding the shape 
and size of the cranial bones to predetermined forms. The ossification of the 
brain-box is merely a process of covering the brain with a bony surface, much as 
a plasterer covers a wall by dabs of plaster each independent of the size and shape 
of other dabs. Even the factor of homology may flow from the symmetry of the 
brain itself. It would not, we hold, be far from the truth to say that it is the brain 
which is inherited, the inheritance of size and shape of the skull itself is only an 
indirect cause of this more fundamental inheritance. 

If the statements made above be even approximately true, what bearing have 
they on the craniometrician’s processes? He has been seeking for some time for 
characters of the skull which are 

(а) practically uncorrelated — independent of one another ; 

(б) racially differentiated. 

The usual ethnographic characters of the anthropologists do not satisfy (a), how- 
ever much they may be of value from the standpoint of (6). Not even in the case 

* Even this reduces to *1208, if wo exclude the four exceptional cases of -Fj with Fj , Fg with 
Pg and Fg with . 

t Not yet investigated. 
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of the ii\dices usually provided by anthropologists is there independence*. From 
the data provided in the present paper, it is possible to select a system of lengths 
and indices with negligible intercorrelations. Will such lengths and indices satisfy 
(6) as well as (a) ? 

In other words, while in the individual skull there is no correlation of the 
growth of bones, will the general method of growth of the skull in different races 
be peculiar to each race ? Have morphometric characters racial value ? This can 
only be ascertained when they have been measured for additional races. But 
should they prove to have — as we anticipate — racial value, then the position of 
the Coefficient of Racial Likeness as a means of discriminating between races will 
be markedly strengthened. It needs a system of characters actually or practically 
independent. 

We wish to thank most heartily Dr G. M. Morant for continuous help during 
the preparation of this paper. 

* We may cite the following data from the present paper and that of Pearson and Davin (Biometrika, 
Vol. XVI. (1928), p. 848, Tabie V): 


Mean Correlation 
of Indices 

Morphometrioal 

Indices 

Ethnographical Indices 

r 

Number 

^ : + 0511 rt 0025 
119 

+*1468 ±-0186, + 1815 -0210 

19 19 


Thus in the case of the males the mean morphometric index correlation is only about one-third of 
the mean ethnographic (or anthropometric) index. The morphometric correlations (excluding homo- 
logous indices) include 61 cases of unilateral and 58 of cross correlations. 
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MISCELLANEA. 


(i) The Distribution of the Difference between the SSztreme Observation 
and the Sample Mean in Samples of n from a Normal UniTerse. 


By a. T. McKAY, D.So. 


Tho pur|) 08 e of the present i»aper is to discuss certain points relating to statistics formed by 
arranging tho observations <if a random sample from a normal universe in order of magnitude 
and constructing linear functions thereof. In particular, a method of determining the significance 
of the difleronce between the highest observation and the sample mean is developed. 

I. Let a;i, arranged in order of ascending magnitude, be a random sample of ?i 

from a normal universe with mean m and unit standard deviation. Consider the statistic JC 
formed by taking a linewr function of the first r observations, i.e. 


r 

X ( 1 ). 

1 

Suppose a and ^ are tho characteristic function paramet^ro of tho mean x of the 7i observa- 
tions and of the function X resj>ectively ; is tho complete distribution function of A" 
in samples of n and <r (w) is its shuidard deviation ; 3 denotes the term “characteristic 

r 

function of...”; and Then if tS denotes tho distribution surface of x and A', 

i 

3 (A') =. (<#>,„-o) (2). 

To prove this theorem consider the case in which n = 4 and r=2. For tliis wo have 


3 (.«?) = <?/■” 

J —CO 


^azhi-h {z-m)* 


dzj 


- oo 



^(a/n 4 a,/3) a:-i(x-my 
oo 

^(a/n+a.l^) w-i 
oo 


.(3). 


Changing the variable in the last integral 'f>n the right-hand side of (3) by writing 

u — — t we find 

n 

(...) (4). 

J — OO J — OO 

Proceeding similarly by changing the variables of integration in turn we derive, in reverting to 
our original symbols, 

y— OO J '-CO * J — CO J —CO 

w- 

This latter expression is in agreement with the special case of (2) and the general theorem 
follows by induction. 


Let us now seek the analytical expression for the surface S. 
written 

3 


Equation (2) may of course be 


( 6 ), 
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SO that interpreting first with respect to a and writing the corresponding variable x and the 
other variable y we have 

(7) 

“ ( 8 ), 

/ — , 1 rf* 

wlience ^ xe {y - 1 (.r - m)} (9). 

▼ SafT 

The correlation coefficient R is derived from (6) by taking logarithms, difierentiating with 
respect to a, ^ and a/3, and writing a*=^~0. Whence 

/?=>-* (lO). 

yfna- (n) 

Further, if R (y) denotes the expected value of y, thou from (6) the regression curve of y on x is 

( 11 ), 

provided the arrays in the numerator and denominator are inteiprcted independently. Hence 

^ 

y = A’ (y) + # (.r - m) (13). 

Similarly the regression curve of x on y is derived from 

B^) 




II. Let us now consider the linear statistical function formed hy subtracting the sample 
mean from the function A", i.e. 

u^{X — x) (Ifi)- 

The characteristic function for the distribution of u is to l>e found from (6) by writing — /3 for a 


and then regarding /3 as referring to u. Thus 

3^/Ca)-.«a« 3^(<^) (17), 

whence / (^^) - e ^ ^ ^ 0( a + m) (18). 

We note also from (17) that if denotes the pth semi-invariant of the distribution of u and 
Kp refers similarly to the distribution of X, then 

(Mean) = Kx —m 'j 

(Variance) -p I (19)* 

Ap=Ap, /?ses2, 3, ... j 


III. We shall now proceed to apply some of the foregoing results to the case when A’'= highest 
observation of the samide of n. Thus the r associated with equation (1) is unity. Since no 
generality will be lost by so doing, we shall take in what follows. The means and standard 

30—2 



468 


Miscellanea 


deviations of the distribution of X for various sizes of samples, which have already been given 
elsewhere*, enable us to construct the following table showing the standard deviation of 
u = {X — x) and the correlation coefficient R for X and x. 


n 

8.1>. of M 

R 

2 

•6170 

•866 

5 

•4976 

•668 

10 

-4943 

•539 

20 

•4761 

•426 

60 

•4368 

■284 

1(X) 

•4176 

•233 

200 

•3946 

•176 

600 

•3677 

•121 

1000 

•3500 

•090 


It is noteworthy that even for samples as large as 60 R is still quite moderate. 
Let us now discuss the distribution of u more closely. It is obvious firstly that 






( 20 ). 


So from (18), if {x) is the probability of a sample of n occurring in which u is less than x. 


F„(x)^e 


1 

'2a da?* 


( - r 

\s/2ir J —a 


' dxj 


.( 21 ), 


and 


’2ndjr* ^ 





Now suppose we proceed as though to expand the right-hand side of (22) by Leibnitz’s 
Theorem, i.e. 


Cl t , \ ^ -tr.va 

V2»r 


( 1- ‘ 
Ks/^rr J --a 


, - !C*/2 


dx 


)■ 


.(23), 


where «= ^ and />2 • Whence 

CXi dx.2 


Now 


X, (^L f-j '^^y- 

V 1 




.(24). 

.(26). 


Substituting (25) in (24) and making one or two obvious steps we find 


n , 

ET//\ ^ ~2 n-l*‘ 

V «-] « ’ 






■jTa + 0 -, /(n - 1)^ _ ~ 1 


) n — 1 

...(26). 


Now we note 


-(.-O' 


k 


{C-;-')-r(.”0' 


Tables for Statisticians and Biometricians, Part II. p, cxiv. 
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Thus, if ^ and x integral functions regular at the origin, 

(w 

Applying the latter theorem to (26), 

<>•>■ 

We now consider the case for n»2. To find the distribution of (whore a ?2 is the 

2S 

larger) wo must integrate 

2tr «“* (**•+*••) d^,diP^ (30) 

over the field for which 



0<(a?2— J?i) J 

Write (:r 2 - a?i)*«2«? and {^a:^-{-Xi) = 2v. Whence (30) becomes 

4e -(«>*+«»•) 

— dwdv (31 ), 

TT 

and(30«*) I (31«*). 

Whence the distribution of u is 

u>0 (32). 

This represents one-half a normal curve. We note that the characteristic function of this 
distribution is 

3/.(«) = ^ 

Further, using (29) and (32), 

^ Jo 

Other cases can be found similarly, but they are of no interest since the integrals cannot be 
evaluated simply. 

Let us now return to equation (29). It is easy to see that when x is very small 

K (*)-l‘ «» . , „>2 (33). 

Again, i^H(^)-» [ J^'ix)dx^ H e'^^dt (34), 

J * V2»r J x^Jn/in-V) 

provided /*«->! small compared with unity. But 

. /-'-'■•* <“>■ 

Whence P. {x) may be found approximately from eqtiation (34) provided the expression on the 
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right-hand side of (35) is small compared with unity. The latter is always less than (x). 
So we may conclude that (34) can always be used for determining the low percentage points ♦. 

In a given case the significance of departure of the highest observation from the mean of the 
sample is derived directly from equation (34). 

dP 

Owing to the fact that ^ is small when x is large it is clearly more accurate to determine 
Pn ipc) when x is given than to construct a table giving x for standard values of *06, ’01, etc. of 
Pn (x). We shall now consider one or two examples of the use of the statistical function u «= highest 
observation minus the mean of the sample. 

Example 1. K. Pearson t gives the capacities of 17 Moriori skulls, the highest value of which 
(1630) appears anomalous, and discusses the rejection of this observation on the basis of three 
different methods. The mean and standard deviation of the sample are respectively 1405*8 and 
97* * § 83. Thus 74 **(1630— 1405*8)/o-, where <r is the sbmdard deviation of the parent universe. Let 
us assume that this latter is closely given by the sample estimate, and that the universe is 
normal. Then i4=(1630 — 1405*8)/97*83 = 2*2917. Also n — 2*3622. Whence from equation (34) 
17 (74) ~ 17 X 0*009084 = 0-154. We should therefore conclude that 1630 was not an anomalous 
value. Applied to the same data Irwin’s testj gives a probability of 0*134. Using E. S. Pearson’s 
table § for the percentage limits of the range, a probability somewhat greater than 0*10 is 
obtained. The three tests show good agreement. 

Example 2. In the course of I'outine testing of a standard leather product of a tannery five 
parallel tests yieldcid the following values for the hide substance content of the leather specimens : 

32*44, 36*45, 39*64, 40*13, 41*09. 

The first observation appears unduly low. 

The question therefore facing the tannery chemist is whether it is worth while instituting 
inquiries as to why such an anomalous specimen has occurred. Long experience of the product 
in question has established a value of 2*226 for the standard deviation. Whence 

u = (37 *95 32*44)/2*226 = 2*475. 

Also was 2*767, whence Ps («) = 6 x 0*00283 = 0*0142. 

Wo should conclude from this that the observation is anomalous and inquiri<^s could justifiably 
be instituted. Irwin’s test applied to the same data gives a value of 0*06 for the probability, 
whilst using the range as criterion E. S. Pearson’s table gives a value of 0*06. The latter two 
tests are just about on the usually accepted boundaries of significance. Accordingly on the basis 
of these one would hesitate to regard the observation as anomalous. It is of course difl&cult to 
dcKside which test gives the surest guidance, though the author’s experience of the material in 
question loads him to believe that the value of 32*44 is definitely anomalous. 

[Note. Some readers may be interested in learning what information the P^n oriterion 
provides with regard to the above Example 2. That criterion gives equal weight to every member 

* The same kind of argument can always be applied in the case of the distribution of the highest 
observation of a sample from any parent universe. Let/{^) be the parent universe, then the distribution 
of the highest observation in a sample of n is 

Pn (^) ^nf{x) (^) 

.*. l\{x)-n j^f{x)dx, 

provided ~ {x) is small compared with unity. This will always be the case in the vicinity of the 

levels of significance which are usually employed. 

t Loc. cit. Tables, Part II. p. oviii. 

X I^oc, cit. Tables, Part II. p. 160. 

§ Biometrika^ Vol. xxrv. p. 416 (1982). 
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of the sample ; this is also true of the ^ test but not of Irwin’s or MoEAys test. The 
criterion gives the value *000 to the probability, if wo use the “established value” 2*2S6 for the 
standard deviation. Thus we infer that the outlier ought to be rejected, or that the sample has not 
been drawn at random from a normal population. On the other hand, if wa adopt the standard 
deviation 3*166 given by the sample itself, we find the probability to be *42, or no question of 
throwing out the outlier arises. But inquiring whether the standard deviation 3*165 was 
a reasonable random sample from a population of standard deviation 2*226, we find that the 
probability of a sample with its standard deviation > 3*166 is only *06, which agrees with the 
results of Irwin and E. S. Pearson^s tests. It would seem Eiccordingly that given the value 3*166 
the sample is quite probable, but that such a standard deviation verges on the improbable if the 
established value of the parental population is 2*226. Ed.] 

(11) Heredity^ mainly Human. 

By ELDON MOORE. 

Chapman and Hall, Ltd. London, 1934. Price 15«. 

The author of this volume writes for “the educated and intelligent reader who knows nothing 
of the subject, but wishes to understand its main outlines without intending to pui’sue it much 
further.” The book covers a wide field, including chapters on genes and the mechanism of 
heredity, on the endocrine glands, on physique and character, on hybrids, on inbreeding, on 
blood groups and on the inheritance of amentia and of ability ; some investigations on inheritance 
in twins are described and an interesting section is concerned with intelligence and social class ; 
the latter part of the volume provides a general review of such subjects as evolution, natural 
selection, mutation, Lamarckism, etc. 

It requires courage to set out on so ambitious a programme and the author is to be congratu- 
lated on having produced a volume which clearly portrays the intense interest, academic and 
practical, attaching t<^ the science of Gtenetics to-day; ho has succeeded in suggesting the general 
trend of progress in this field and the volume will certainly stimulate interest in the subject. 

Writing admittedly for the reader “who knows nothing of the subject” carries with it a 
special responsibility with regard to the accuracy of the facts given ; a number of statements 
are made which will arrest the critical mind of a reader who has some knowledge and may 
perhaps shake his confidence. To give one instance on pp. 69 — 70: “The endocrine constitu- 
tion... x)lays a part... in three of the commonest human ills with a genetic basis — asthma, cancer 
and tuberculosis, which are in some way related to one another..,. Cancerous i)atiouts are very 
seldom tubercular ♦ while the tuberculous are but rarely afflicted by cancer... .Tuberculosis is 
markedly rare both in asthmatic patients and in their families.” All these statements are most 
challenging and are far from being x^roven facts; the genetic basis of cancer in man has not been 
established, nor that its sufferers, as such, carry any insurance against tuberculosis. The genetic 
basis of asthma must surely be regarded as standing upon precarious ground until wo know more 
of the incidence of asthma in the general population and the incidence of all the other ailergioal 
diseases which may or may not replace it. 

On what evidence does the author state that children who will later develoi) Huntington’s 
chorea very frequently have im usually small heads? Other similar criticisms might be given but 
we can recommend a volume which contains so much interesting suggestion, and only regret the 
occasional lack of caution in the too definite presentation of unproven facts. 

* [As a matter of fact cancer of the lungs is often found to develop on the site of healed tubercular 
lesions. See Earl Pearson, ** Report on certain Cancer Statistics,** Archives of the Middlesex Hospital, 
pp. 8 — 12. Nor can we yet assert that the tendency to cancer is certainly hereditary. See iHd. pp. 6 — 8; 
bnt compare Brenda Stoessiger, ** Inheritance of Duration of Life and Cause of Death,** Annals of 
Eugenics, Vol. v, ** Cancer,** pp. 148 — 160, especially p. 150. See also Percy Stocks, ** A Cooperative Study 
of 450 Cancer Patients and an equal Number of Controls,** ibid. Vol. v. pp. 274 — 278 and 280. Ed,] 
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